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SUMS OF SQUARES OF POLYNOMIALS 
By LreonarD CARLITZ 


1. Introduction. In this note we determine the number of representations 
of 0 as the sum of an arbitrary number of squares of polynomials in a single 
indeterminate with coefficients in a fixed Galois field GF(p"), p > 2. More 
accurately, if a; , --- , a, are t non-zero elements of GF(p"), a: + --- + a = 0, 
we determine the number of solutions of 


(1.1) 0 = mYi+---+a¥3 


in primary’ polynomials Y ; each of degree k, an assigned positive integer. We 
denote the number of solutions of (1.1) by 


NO) = Ni (0). 
The more general equation 


(1.2) aG = mYi+.---+a¥i, 


where aG # 0, G of degree S 2k, has been treated in two papers, one on the 
case t even, the other on the case t odd.’ In the latter paper a formula for N»2,(0) 
appeared incidentally. We shall derive this formula anew by the simpler and 
direct method used in the paper on ¢ even. 

To evaluate Ne2,,,(0), we make use of a known formula for N2,(G), the number 


of solutions of (1.2) fort = 2s. Applying this formula, we first evaluate the sums 


N2.(G) N2.(G’) 
a Ci 





extended over all primary G; the latter sum leads at once to the determination 
of Noss1(0). 


2. Determination of N»2,(0). In equation (1.2), let ¢ = 28,a = a@+--- 
+ a, ~ 0, so that G is of degree 2k. Assume further 


(2.1) Yi = Qi + a; ~ O (@¢ = 1,---,8). 


Received October 27, 1936. 

1A polynomial is primary if the coefficient of the highest power of the indeterminate 
is the unit element of the Galois field. The capitals A, B, E, G, M, U, V, Y will denote 
primary polynomials. 

2 The even case in Transactions of the American Mathematical Society, vol. 35 (1933), 
pp. 397-410; the odd case in this Journal, vol. 1 (1935), pp. 298-315. These papers will be 
cited as I and II, respectively. 
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2 LEONARD CARLITZ 


Then by Theorem 4 of paper I, the number of solutions of (1.2) is determined by 


m>k m=k 
(2.2) N2,(G) = (1 ~ =i) i's [M|" + Dox™| Mi. 
M MG 


Here x = +1 or —1 according as 
(2.3) (—1)* ayaz - ++ ae, 


is a square or a non-square in GF(p"); m is the degree of M, | M| = p””; the 
summations are extended over all M dividing G of degree m > k and m = k, 
respectively. If now we denote the right member of (2.2) by A,(G, x), we 
shall prove the following formula: 


; AAG, xJAAG, x2) 
deg G=2 


(2.4) ha 
x u n(2k—u)(st+it+l)) nu k  nk(s+t+2) 
= (1 — =e) > x" p” ( p'“+x Pp ( 


=(0 
where x = x:x2, and the summation in the left member is over all primary G 
of degree 2k. 
The proof of this formula is similar to the proof of Theorem 2 of paper I, 
and therefore only a sketch of it will be given. By means of (2.2), the left 
member of (2.4) may be put in the form 


(-0-AE (AE 


u ul > 18 rit 

+(-E) Et Ea rivi 
om 
v>k k 
where each summation is over all A, B, U, V of degree a, b, u, v, respectively, 
such that 
AU = BV, atu=b+v0 = 2k, 

and in addition satisfies the conditions under each Y. Call the sums 2, 22, 
>; , X4, respectively. Then 


b= 2 xe Ul |v’ 


AU=BV 
u, wr 
(s+ by} —s | — 
= ” acetates x1 x2 | 4 | *| B| t 
AU=BV 
ab<k 
2nk(s+t) j~o-g 
=p wate ; x” | M | Su, 
deg M<k 
where 
-y a _b |—8 jt 
s Su = > xix2/A | | B| I 1. 
(2.5) (A,B)=1 AU=BV 


ab<k—m atu=b+v=2k—-m 














SUMS OF SQUARES OF POLYNOMIALS 3 


Here the outer sum is over all A, B of degree a,b < k — msuch that (A, B) = 1, 
while the inner sum is over all U, V of degree 2k — m — a, 2k — m — b, respec- 
tively, for which AU = BV. Since (A, B) = 1, it follows that A | V and B| U. 
For the moment put V = AE, U = BE, so that deg E = 2k — m — a — b. 
Thus for fixed A, B there are | E | choices of U, V satisfying the conditions men- 
tioned. In other words, the inner sum in (2.5) = p"“~""*~”, and therefore 
(2.6) , = ” pani fa er 1. b), 
a,b<k—m 

where’ ¥(a, b) denotes the number of pairs A, B of degree a, b, respectively, 
such that (A, B) = 1. The sum in (2.6) is easily evaluated but this need not 
be repeated here. It is sufficient to notice that comparing this point in the 
present proof with the corresponding point* in the proof of Theorem 2 of paper I, 
we are at once able to conclude that the left member of (2.4) 

iin ” acciices a x” | M ‘ie 


deg M=k 


x 2nk(s+t+1) m —e—t—1 
+ (1 - seas) P woe, | M 
ki 


k  nk(s+t+2) x n(2k—m)(s+t+1) nm 
= xp oa (1 — —— dx"? yy”. 
We have therefore proved (2.4). 
It is now easy to evaluate N»2,(0). Since the conditions (2.1) are assumed to 
hold, we have in particular a = a, + --- + a2 # 0. Then clearly all solu- 
tions of 


0 = a Yj + aie + 2s Y3. (deg Y; = k) 
may be obtained by pairing the solutions of 
aG = mY +--+ + an 2Vo2 (deg Y; = k) 
with those of 
—aG = a1 ¥ 36-1 + a2, ¥35 (deg Y; = k) 


in all possible ways, and allowing G to range over all polynomials of degree 2k. 
We define x; as +1 or —1 according as (—1)* aya - - - a._2 is or is not a square 
in GF(p"); x2 is defined in the same way with respect to —ae,;a2,. Then, 
by the remark just made, it follows that 

N2,(0) = ae Now-2(G)N2(G), 


deg G=2k 
and by (2.2), the right member 
= _ As_o(G, x1) Ao(G, x2). 


deg G=2k 


3 See I, p. 399. 
‘1, p. 405 bottom; ef. also proof of I. Theorem 1. 
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We may now apply (2.4), and we have at once 


k-1 
nks x m n —m)\s—1) nm 
(2.7) N2,(0) =p . “be (1 - ok) 2x P res Pp . 
Thus we have the following 
TuHeoreM A. If a, --- , a2, are 2s non-zero elements of GF(p"), such that 
a + --- + a, = 0, while 


Q2;-1) + a; ~ 0 (¢ = 1,---,8), 
then the number of solutions of a,Y{ + --- + ax.¥3, = 0 in primary Y; of degree k 


is furnished by (2.7) together with (2.3). 


3. The sum 2N2,(G)|G|~“. In (2.2) G is assumed to be of even degree. 
However, if we rewrite the equation in the following form 


m>u m=u 
(3.1) N2;(G) = (1 _ ts) 2 x ier s+ 2 xii, 
pre) ] giv G=MU 
then N2,(G@) is defined for all G (for G of odd degree the second sum on the right 
is vacuous). We may now consider the sum YN2,(G) |G |~" taken over all 
primary G. Thus substituting from (3.1), we have 


No(G) _ os x"|M\"* nas) yr x" | M | 
(3.2) » iG|’ a >i |MU| + — x") Zz “|MUP ’ 








where the sums on the right are over all M, U of degree m, u, respectively, satis- 
fying the conditions indicated. From this it is clear that the right member of 
(3.2) may be reduced to 





) 2 ~ 
ym "a liaudatias + (1 .* xp""”) > ” anion > "lad 
m=0 u=0 m=utl 
1 ‘ , “ wine 
es n(1—s u , nu(s+1—2w) 
ad 1 a Kprieti—tw) + (1 ai XP ) >> x P 1 aie xp" 
1 i xp" xp" 


= - 


1 ~ pret) 








cans xpreti—2w) = xp)’ 


from which it follows that 


1 n(1—w) 


" N2:(G) _— P 
(3.3) > |G\e ss (1 ome xpreti-2w)) (1 a xpre~) ° 





If we split the right member into partial fractions, (3.3) becomes 


> N2.(@) - 1 ” lind 


G 14" ~ o- xpre-) ae eo x pr(eti—2u) ° 


(3.4) 
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From this it is easy to evaluate }> No,(@), summed over all G of fixed degree. 
For even degree we have the following simple formula: 


(3.5) > N.(G) = 


deg G=2k 


’ 


while for odd degree, 


(3.6) D2 N2,(G) _ — Pe xp** p”. 
deg G=2k+1 
The sum formula for even degree (3.5) may be derived in a slightly more direct 
manner. We consider the sum )> N2,(@) | @|~" taken over G@ of even degree 
only, so that Ne,(@) has its original meaning. Then as above, 


s—l m s—l 
N2(G) _ => x” x | s M = ahs (1 es xp ” Maslin me x | M | 


deg G even “14 G je m=ui | “|MU\ |w m>u | MU \" 


m—u even 


m nm(s+1—2u n(1l—s nu(l—w) m_ nm(s—w) 
p + (1— xp""”) Dp > x"? 





M 





m=0 ~~~ § — ae 
bed 2n(s—w) 
os ee + Ee es u pw 
_ 1 at ” — xp" Pei 
= 1 — xprot2w) — xpren- 2w) ee pene)? 
from which we have at once 
N2(G) _ 1 


3.7 a 
, deg Geven | G wa 1 én pene) 


this agrees with (3.5). 


Comparing (3.7) with (3.4), we have 


n(s—w) n(1—w 
Ne(G) _ xp p 1—w) 
deg G odd 1G i” ~ 4] — prnte—w) = ee xpreti-2w)’ 





which is equivalent to (3.6). 


4. The sum >> N2,(G’) |G)". The evaluation of this sum is somewhat 
more elaborate than that of the sum (3.2). If we put G = M)U>, and write 
(Mo, Uo) = D, then clearly My/D and U,/D are squares; we may therefore 
suppose that 





G’ = DM’.DU’, G = DMU, (M, U) = 
Then substituting from (2.2), we now have 
N2,.(G’) ” ian d | DM’ = , d | DM’ a 
4.1 =(1— [DM | 
0 Doge = ) 2® tpMupt 2.* [DMP 
peg m=u 
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where the sum on the left is over all G, while the sums on the right are over all 
D, M, U of respective degree d, m, u satisfying the condition under each sa 
and in addition (M, U) = 1. We may evidently put the right member of (4.1) 
in the form 


“ . T 
4.2 d _ nd(s—w) T = = P 
( ) p> x Pp Sins xp" ? 
where 
(4.3) T _ >> M pars U + (1 — xp"" “8 ) : | M ‘ioe U tee" 
(M,U)=1 (M,U)=1 
m=u m>u 


As for the first sum in (4.3), for ¥(a, b) as defined in (2.6), it is clear that 


io) 
| M ‘ie U ‘> as > spares ”) Y(m, m), 
(M,U)=1 m=0 


mu 


1 of (1 us p") tm —— 2w) 
1 


n(2s—1—2w) 


1—p 
(4.4) = Tx peerter 
The calculation of the second sum in (4.3) is somewhat longer. In the first 
place 
[MPP rule = > pr  (m, u). 


(M,U)=1 m>u 
m>u 


The terms in which u = 0 contribute 





x n(2s—1—w) 
~ nm(2s—2—w) nm __ P 
(4.5) au P P bas = prae-i—w) 5 
For the remaining terms, we have 
nm (2s—l—w 1—w) ~ 
nm(2s met (1 — ") 
m>u>0d 
«eo «x 
~- —w nm(2s—1—w) 
i ‘al Ps p » pr" w) p m(2s—1 
n(4s—1—3w) 
aa p 
(4.6) = (1 - P ) ( — pres-2w)) (J rs prs-1—w))? 
by an easy calculation. Combining (4.4) and (4.5), we see that the second sum 
in (4.3) 
n(2s—1—w) n(4s—2—3u 


oe ak 
el sl pres-2v)) (1 re pr(te—i—w)) ° 
Substituting from (4.4) and (4.5) in (4.3), we may verify that 
a (1 e% xp") (1 a asides 











= fal ex pr2s-2w)) (] — praei—w)y * 
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Therefore, returning to (4.2), we see finally that 


N,,(G’) l—p 
(4.7) > |G\v me al ‘a pris2w)) (J “3 prize—i—w)) * 


n(2s—1—2w) 








5. Determination of N2,,,(0). Making use of the formula just proved, we 
shall now evaluate Ne,.:(0). Let a,, ---+ , aes; be 2s + 1 non-zero elements of 
GF(p") such that a; + --- + a4, = 0. We may assume them so numbered 
that the conditions (2.1) are satisfied,’ and therefore N2,(G) applies and is deter- 
mined by (2.2). Now the number of solutions of 


0 = oY} tere t or0041V $041 (deg Y; = k) 


is clearly the sum of the number of solutions of 


— O251G° = a,Y° +--+ a2, ¥'e, (deg Y; = &), 
extended over all G of degree k. In other words, 
(5.1) Nos41(0) = aia N2,(G*). 
But the sum in the right member is precisely the coefficient of p-“” in the 


expansion of 





. N2(G) wo 1 aca 
(5.2) FB _ G _ — ) re og N 2,(G ). 
G ;uy k=0 DP deg G=k 
Comparing (5.2) with (4.7), it is clear that the quantity in question is the co- 
efficient of p ™” in 
(1 = — asset z ” ee Qiu ~~ 1) j—njw 
i,7=0 


aon (1 os ” gicaliaiiaay p> "aia + oe p + eo p™ 4. aes bap 


Therefore, finally 


(5.3) No,41(0) = po lk 4. (1 = p ") ” si ee, 
0<2i<k 

For k < 2, this reduces to the first term: N24,(0) = p"“”*. According to 
(5.3), Nesii(0) depends only on s and k; in particular it is independent of the 
numbering of the a; required at the beginning of this section. We may now 
state 

TueoreM B. If a, --+ , 241 are 2s + 1 non-zero elements of GF(p"), such 
that a, + --+ + aes; = 0, then the number of solutions of mm ee ee ee) ce 
= 0 in primary Y ; of degree k is furnished by (5.3). 


Duke UNIVERSITY. 


5 See II, §3. 








THE DEGREES OF THE IRREDUCIBLE COMPONENTS OF SIMPLY 
TRANSITIVE PERMUTATION GROUPS 


By J. SUTHERLAND FRAME 


1. In a study of certain hyperorthogonal groups,’ there arose the problem of 
splitting into its r irreducible components a simply transitive permutation 
group G* of degree n and order g, which, when written in matrix form, gave an 
isomorphic representation of the given abstract group as a group G of linear 
transformations. In any simply transitive permutation group, the subgroup 
leaving one symbol invariant will permute the remaining symbols in \ = r — 1 
sets of transitivity of k,, ke, ---, ky symbols respectively. Let the distinct 
irreducible components of the group G have the degrees mp = 1, m, ---, my’, 
and note that r = A + 1 is the sum of the squares of the multiplicities with 
which these occur in the reduction of G.2 When the components are all distinct, 
and \’ = X, there appears to be a simple relation between the product K = 
1-k, kp --- ky and the product N = 1-mm2--- m. 

Consecturep THEorEM I. n'K/N is an integer when the components of G 
are distinct, and this is a perfect square R® when the nuinbers k; are distinct. 

We shall prove the theorem for all groups for which \ S 3, (here \’ must 
equal \), and for an infinite family of groups including all values of 4. When 
\ = 1, the group G* is doubly transitive and n, = ki = N = K = n — 1,s0 
the result is trivial. When \ = 2, our theorem gives us a diophantine equation, 
nkike/nynz = R’, which, with n, + ne = n — 1, enables us to solve for the un- 
knowns 7 and ne. 

To illustrate the application of the theorem, before passing to the details 
of the proof, we take as an example the case of the hyperorthogonal groups, 
where the problem of this paper was suggested.’ We have here a permutation 
group of degree QnQm—1/Qe (where Q,, = q” — (—1)”",q = p’, p prime) which 
is known to have 3 irreducible components. We know also that k, = q°"™*, ke = 
FQ m—2Qm—s/Qe. Hence, nme = 9" QnQm—1Qm—2Qm—s/Q2R’, where R is an in- 
teger, and m + me = nm — 1 = (QnQm+ — Qe)/Qe. The degree of n; or ne as 
a polynomial in qg is 2m — 3, that of the other being less; so the degree of R in q 
is at least m — 2. Since mz is divisible by an odd power of q, and n; + ne is 
divisible by q’, it follows that m , say, is divisible by q° but not q°, and nz by q° 
or some higher odd power. R, not being divisible by ¢”*, must contain a factor 


Received September 17, 1936. 

1J.S. Frame, Unitdére Matrizen in Galoisfeldern, Commentarii Mathematici Helvetici, 
vol. 7 (1935), pp. 97, 98. 

J.S. Frame, The simple group of order 25920, this Journal, vol. 2 (1936), p. 477. 

2 W. Burnside, On the complete reduction of ony transitive permutation group, Proc. Lond. 
Math. Soc., ser. 2, vol. 3 (1905), p. 239. 
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whose square divides QnQm—1Qm—2Qm—s/Q2. The only such factor is Q;, or pos- 
sibly Q; for certain values of m. The latter possibility does not work, and we 
find R = q” *Q:, My = FQnQm—s/QQ:, ne = FQ in1Qm—-2/Q2Qr . The degrees 
of the irreducible components are hereby uniquely determined from our formula. 
In attacking the proof of the theorem, we shall first study in §2 the ring of 
matrices permutable with the group G, and then the secular equation, in terms 
of whose roots p; the constants n; may be expressed. In §3 we shall give the 
proof of the theorem for the cases \ = 2,’ = 3. In §4 we shall prove it for a 
certain infinite family of groups including all values of \ by solving explicitly 
for the constants n;. A lemma on binomial coefficients will be proved in §5. 
zr, 


2. To each permutation e ) of the group G*, there corresponds in G a matrix 


of degree n, (643), Where a, 8 = 1, 2, ---,, which may be thought of as trans- 
forming the space of the variables x; --- z,. Let the subgroup G; of G leave 2; 
invariant and permute the remaining variables in \ = r — 1 sets of transitivity 
T,, of K, variables respectively, 4 = 1, ---,A. Then, setting ko = 1, we have 


(2.1) . > ki = n. 


Now if x, be any variable from the set 7; , and we transform the subscripts 
of the product 7:2, by each permutation S of G*, so that S transforms z, into 


Sa, and #, into S% = Sz, then the sum (nk;,/g) 7 (S2,)(Sz,) is an invariant 
Sing 
hermitian form H,. Its non-vanishing coefficients are all 1, since the subgroup 


leaving zx; and z, both invariant is of index nk, under G. Its matrix of co- 
efficients V, is one of the \ + 1 basis elements of the ring of matrices V which 
are permutable with each of the permutations of G. The conjugate imaginary 
form H, = Hz» is also invariant, and so the corresponding matrix V,», the 
transposed of V,, is in the ring. In particular, Ho is the unit hermitian form, 


aN 
and V>) = J the identity matrix. Furthermore, W = ; V, is a matrix con- 
t=0 
sisting entirely of 1’s. 
r 
Let V = >> a,V; be an arbitrary matrix of the ring, and let products be given 


1=0 


by the formulas 


r 
(2.2) ViV; = im CijeVs 
s=0 
PN 
(2.3) VV; cS } ® C.V., where Cis = Yn OG Cijs « 
s=0 1=0 


37. Schur, Zur Theorie der einfach transitiven Permutationsgruppen, Berlin, Sitzungs- 
berichte (1933), pp. 598-623. Schur has developed several of the properties of this matrix 
ring, applying it particularly to the study of permutation groups having as a subgroup a 
regular group of the same degree. We assume no such subgroup here—in fact, none exists 
for the simple groups in which we are particularly interested. 
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Since V;W = k,W, we have 


IN 
(2.4) D> Cie = hi. 

j=0 

In order to display the reducibility of G, we must change the variables in 

such a way that the invariant hermitian forms H, become diagonal forms 
S'H.S on m variables respectively, ¢ = 0, 1,---,. The matrices V, will 
then also be brought into diagonal form S”~'V,S, and can be written as linear 
combinations of new basis elements M; , each of which is merely a multiplication 
on a set of n; of the new variables. 


r 
(2.5) MM; = piMi, M = S'VS, pi = D> pre. 
t=0 
Since, when we choose the parameters a, so that M = M;, we have M;M; = 


piM; = 0, but M;M; = p;M; # 0, no two multipliers p; and p; corresponding 
to different M; and M; can be identical in the a, . Each multiplier 9; is a root 
of multiplicity n; of det(V — pI) = 0. More simply, it is one of the \ + 1 
distinct roots of the secular equation 


(2.6)  D(p) = det(Ci; — péi;) = (0 — p)(or — p) --+ (px — p) = 0. 
By adding all the other rows to any particular row of the determinant D(p), 


we readily verify by (2.3) and (2.4) that 


N N 
(2.7) po = > kway = > Ci; 
1=0 ¢= 


is a root. In particular, setting a; = 1, we find that p. = n is only a simple 
root of det(W — pI) = 0, the other roots being all 0. Hence pp is the root of 
multiplicity mo = 1, and each of the other roots are functions only of the differ- 
ences a; — ao. If we denote the trace of the matrix V" by ns,, then 


r 
(2.8) ns, = Doni, 


and we may solve for the constants mn. We let A be the Vandermonde deter- 
minant of po, pi, -- + , px, and A, be the Vandermonde determinant of po, pi, - - - 
Pit, Pts, +++, px, noting that (— 1)'"'D'(p,) = A/A:, and obtain 


’ 


Bae 1 we 
7 Poss: pi-1 81 Mtn + ** Pr 

(2.9) —~A=! 2 2 2 2) =F A, 
n Po *+* Pr-1 82 Pt ++ Pr 
a : , 
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3 8 ° ° +1 . 

where F, is the polynomial obtained from (—1)‘"'D’(p,) by replacing each power 
h ° = 

p: of p, by the corresponding s,. Since m = 1, we have 


(2.10) Fo = A/ndo = —D'(p)/n, 


F, nA/nd, = (— 1)'"'D'(p.ni/n. 


It follows that 


PN 
N/w-! = n° TY (m/n) = (« Ir : /TI(-0" De) 


(I F.) /* = (1 F.) / Fo AG. 


The proof of Theorem I depends now only on the proof of 
» 
Lemma I. J] F, = KF(F’, where K = kike --- ky, and F is a factor of Ao 
t=0 
such that R = Ao/F is an integer. 


(2.11) 


3. We shall now prove the theorem for the cases \ = 2, = 3. To facilitate 
the algebraic work we introduce the notation: 


(3.1) 0 =po— pi; do=1, di= DO, be = D 4:0;, +--+, dn, = O02 - +H. 


(3.2) % = 1 — 1/n, 0; = 81 — Po, Oo, = > (") (—po)‘ 8+, u> 0. 
t 











Since 
1 1 «++. 1 1 its 1 | 
C1 —6; eee —A, —A; | 
(3.3) le Baw CP) © co ‘ | 
a, (—@)*---(—&) (—a)*" a 0)" | 
= (—1)*(@, + ogra + +++ + ogo) 
and 
(3.4) Fy = (—1)'¢r/n, 
we have 
r 
(3.5) > Ory. = 0. 
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The formula for F,, t > 0, becomes 


|] 1 cis 1 1 1 tee l 

0 —O, +e) —O1 Oo) —Or41 +++ —A | 
F, =| 2 2 2 2 

0 o2 Gian ces A 


0 (—6@)*---(—A4)* oa (igs) «+. (=a) | 








1 1 ee 1 1 Le 1 | 
0 ok tex <aliie wie. 308 oll 

~ 190 - «se We eS a € 
0 (—a)*.--(—@.s)"* (—t4:)"" --- (—a@)" 


= (=1) [origi + orgs + --- + a], 


where ¢{” is obtained from ¢, by setting 6, = 0, and satisfies the equation 


os" =  — 0:65°,. In view of the identity (3.5), we have 
Fy = (-—1)'[—oodrx — ora — 24) — «++ — nai — o1”)), 
and hence 
(3.6) Fe = (—1) "Or ov@ss + ordi + «++ + ora90]. 
Now consider the case \ = 2. We have 
(3.2a) o =1—1/n, 0, = —(aki + ache), of = 0% + atk; + adhe, 
(3.4a) Fo = 0,02/n, Fy = 0:(0002 + 01), Fe = —62(000: + 01), 
FoF iF: = —Fo6:2(05 0102 + o001(1 + 2) + 03) 
= —Fon(— ooo2 + oi) = Fo[(n — 1)o2 — noi] 
= Fo[(ki + ke)(aiki + agke) — (aki + ake)’, 
(3.7a) FoF: F2 = Fo(kike)(a1 — a2)” = KFOF’, 


where 


(3.8a) F = (a — ae). 














COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 13 


To prove the lemma in this case, we show that F divides Ao . 


ao — Pp ay ae | a — Pp a ae 
D(p) = mk, Cn-— p Cr | = | mk, Cn — p Cx | 
(3.9) a ke Co Cx — p ft ee et ee 
a — a—p a, — ae 
= (po — p) = (po — p)(p — p)(p2 — p). 


A*/n*Fo = Bo = (pr — po)” = (pr + 2)” — 4pipe 
= (ao — a2 + Cu — Cr)’ — 4(a0 — a2)(Cu — Cx) 
+ 4(a, — a2)(ark; — Ci) 
= (—a2 — aie — ea + arcye + aCe)” 
+ 4(a, — a2)(arky — arene — aCe) 
= (a — as)*(eu2 — Cin)’ + 4(a1 — a)’ cr. 
(3.10) Ab = (a1 — a2)" [(en2 — em)” + 4ee2] = FPR’. 


This last bracket will be a perfect square 
(3.11) R® = (en — em)” + 4eo 


if ki + ke, since p; and pe will then be rational. Lemma I follows from (3.7a), 
(3.8a), (3.10), and (3.11), so the theorem is proved in this case. 

The algebraic work, even for the case \ = 3, is quite complicated. Using 
the same notation, we have 


a = 1—- 1/n, Oo = — (ak; + arke ao azks), 
o2 = 01 + ak; + azke + asks, Fobs + aide + o2¢1 + o3 = 0. 
(3.4b), (3.6b) Fo = — 0; 02 63/n, F, => — 0; [0 02 Os + 01 (02 + 63) + a2, ete. 


(3.2b) 


(3.7b) F\F2F3/Fo = —nlood; + 205 01¢2¢3 + op 02¢1¢3 + ooo1(b2 + o1¢s) 
7 
+ a90102(3hs + did2) + oi (Gob: — os) + ojo2(b2 + $3) + ores: + oF). 


The left hand side may be simplified, and can be shown to equal KF’, where 
F = (a2 — as)(az — a1)(C23 — Cn) + (a3 — a1)(a3 — a2)(Cu — Cre) 
+ (a, — a2)(a1 — as)(Ciz — Cis) — (a1 — a2)(a2 — as)(as — a1). 


By (2.11), we have N/n* = KF’/Aj;, or Kn "/N = A/F’. If the numbers 
k; are distinct, the roots p; --- p, must be rational, and so must Ao , the product 


(3.8b) 
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of their differences. Ao/F is therefore a rational fraction, and is independent of 
the parameters a;. Assume it reduced to lowest terms. For a. = a; = 0, 
we see that the denominator must be a factor of ¢i2 — Cys ; similarly, it must be 
a factor of C223 — C21 , and Of ¢331; — C332 , and hence a common factor of these three. 
But since we can choose a; , a2 , a3 SO that F has no factor in common with these 
three, the denominator must be 1, and Ao/F = R is a rational integer. 


4. We shall now consider an infinite family of simply transitive groups for 
which we can compute the constants n;, and shall show that our formula in 
Theorem I holds also for larger values of \ than \ = 3. We define these groups 


as follows. Under the symmetric group of degree v = \ + yu, the n = * com- 


binations of the vy symbols taken \ at a time will be permuted among themselves 
by a transitive group G*. We may denote these combinations by variables 
Zi, ig---i, » Aefined to be independent of the order of the subscripts. The sub- 
group G, leaving fixed the variable x,2..., permutes transitively among them- 
selves those variables which have exactly \ — t of the subscripts 1, 2, --- ,A, 
and ¢ of the remaining u subscripts. Hence we have r = \ + 1 sets of transi- 
tivity T7,,¢ = 0,1, 2, --- , A, which contain respectively k, = (*) (*) variables. 
The structure coefficient cq; already defined in (2.2) for the ring of the matrices 
r 
V = > aiV; is the coefficient of aaa in the inner product of the row of V cor- 
i=0 
responding to the variable 2;,2,...,, and the column corresponding to the variable 
X141,t42,---,4a- It is given explicitly by the formula 


(4.1) ew = E(,' Vy!) my Bebe | 


We verify that 


an) cn EC Co? M)Co ab 2) = (62 ade) (6) = Hm 
(4.1b) Cave = (‘) (* « ) = ba. 


Let the characteristic equation det (x QaCabt — pore) = 0 have the roots 


a 


» 
(4.2) ai = po Pita. 
t=0 


Then by solving for simple cases, and generalizing the result, we surmise that 


(4.3) pu = > (.§)7* (‘) e- ea Po = ki; po = 1. 
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To prove that this formula is correct, it is sufficient to show that 


(4.4) Zz Cabt Pit = Pia Pir, 
t=0 


since if we multiply by the corresponding p, each column of det(2 QaCabt — pow) 


and add it to the first, this first column then has in its (6 + 1)-th row the 
quantity 


> Qa Cabt Pit — Pp = > Qa Pia Pib — PPih = (p; = p) pin . 
a,t a 


Hence the determinant vanishes for p = p;. Equation (4.4) follows from the 
rather complicated identity 


t t \/r—-t ut fi\frx —i 
kaamen Ut. p> GG!) 8 Meas a e—-)(-» (oC 5 
m—?\_ pft\f(A—t\fu-—7t » avaft\fA—t\fu—?2 
. > yn (3 ame) [ams pote) (OC =4) 


for all A, uw, 7, a, b; the summation being taken over all values of the arguments 
for which the binomial coefficients are different from zero. 
In order not to interrupt the continuity, we postpone the proof of Lemma II 
until §5, and proceed to determine the constants n;. The equation (2.8), for 
r 


h = 1, is }> nip; = nao, and actually involves \ + 1 equations 


i=0 
A+1 
(4.5) 2 Ni pit = Nbo , t=0,1,---,A, 
1=0 


which can be solved uniquely for the constants n;. The determinant P = 
det(p::) has the value 


r J 
— 2j 
4.6 P = (-1)" (" ') 
(4.6) ("4 * 
and 
(4.7) % = (:) _ We J = (v+1 — 2)r!/(v+1—a)!a. 
We may now verify Theorem I for these groups. We have 


P ny 
N = JI nm =I] (@+1- ol +i - Oley 


nY 
nK = [[ nk =[] Plttha—olit@—-A-OY 
sy 1 


(= 


A 
nK/N = JT] {~4+1-0)VA-—dO!t@-A-—O!W~41- W} 
t=1 
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nN 
(4.8) 7" K/N = JT] {~?-—OVA-d)IE-DIG@-A-O!41-— Wl = PR, 
where 
[\/2] 


(4.9) R= 1 {1/(t — on | I] {(v — 2s)!/(v — 1 — 2d + 2s)! }. 


The quantity R is easily shown to be an integer, and the theorem is proved for 
this family of groups. 


5. We close with a proof of Lemma II, which in itself involves interesting 
relations between binomial coefficients. Use is made of the following identities, 
which may be readily verified. 


(5.a) E-0"(,4 x s")- pout 2). 
so E()()(2*.)- C5 9G) 


(5.7) 2 ipa Pm De eat} 


Proof of Lemma II. 
i t t A-t —t uft\frxA -—2 —1 
6D BO MMe Yes d ae JO OOM Ea) 
_ uf t\futv\futv\/rA—u—v\frA-7% 
a nO Pas as als | 
(5.2) F 
(pre (45 *)y where 0 = 1 — x, 


a uft utv+s—x—q\f—v—s+27+4+q v 
7 ef) (')( p-r mee SR 


U,V,ZY 


69 (Co tee dete MOOG 
(+5) OGasict ea 


by repeated application of (5.y) and (5.8). Now if we rearrange the factorials 
in these binomial coefficients in a different way, and set 


seirnzn= (2 VV cals Oe —o4 0) 
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we obtain for (5.3) the expression 


u PH MPhcaetl t—wu 
z= cn(,! u et a 
(5.4) °“"" 

(51) anna 


a _ ptq-rtste—z § a v—Z a 7 
m,,| p+q—r—a—b+s+y/\s+v—-2 
ey 


(622) )rmnnen 
- ayy ~q+a he +v—2— ,) 


()(=) (anna 
on = > Kv (')(\ioratety—a-bsrazy, 
Fae ht gaining Mereeanee 
Cr a 
6 = Ev (MNO - VO =e) a) 
( 


im = EI NEM DEM DO=G= 0) 


which proves the lemma. 


Brown UNIVERSITY. 











STRESSES IN MODERATELY THICK RECTANGULAR PLATES 


R. Arcuie Hiapon anp D. L. Hoiu 


1. Introduction 


In 1922 G. D. Birkhoff' suggested a method for solving plate problems which 
involves the representation of the displacements by power series. C. A. Garab- 
edian’ and H. W. Sibert® used this idea in developing methods for solving prob- 
lems in moderately thick circular plates. Garabedian‘ has also published some 
results for uniformly loaded rectangular plates. 

The authors give a solution for the displacements in an elastic isotropic 
moderately thick rectangular plate under the action of any given load which 
can be expressed as a polynomial in x, y continuous over the entire plate and 
with prescribed boundary conditions at the edges. The method, similar to 
that used by Sibert’ for circular plates, is based on the assumption that the 
components of displacement can be developed in positive integral powers of z. 
In this type of problem, the displacements must satisfy (a) the stress equations 
of equilibrium throughout the plate, (b) the surface traction conditions on the 
upper and lower faces, (c) the boundary conditions at the edges. 


2. General theory 


a. Form of the displacements. The displacements, u, v, w, are given by 


n 


oo n « n 
<s = Z 
(1) u= U. — v= V.— w= W,-—, ° 
p> nl? > nt’ >» n!’ 
where U,, V,, and W,, are continuous and continuously differentiable functions 
* oye : . 6 
of x,y. The equations of equilibrium are (A. E. H. Love,’ p. 134) 


0 0 24 2 

(2) (A + u) (2. ay’ 2) A+ uv (u, v, w) = 0. 

Received July 15, 1936; presented to the American Mathematical Society, April 11, 1936. 

1G. D. Birkhoff, Circular plates of variable thickness, Phil. Mag., vol. 43 (1922), pp. 953- 
962. 

2C. A. Garabedian, Circular plates of constant or variable thickness, Trans. Amer. Math. 
Soc., vol. 25 (1923), pp. 343-298 

3H. W. Sibert, Moderately thick circular plates with plane faces, Trans. Amer. Math. Soc., 
vol. 33 (1931), pp. 329-369. 

4C. A. Garabedian, Comptes Rendus, Paris, vols. 178 (1924), 180 (1925), 181 (1925), 
186 (1928), 195 (1932). 

5 Loe. cit. 

6 In this paper all references to Love are to the fourth edition of his Mathematical Theory 
of Elasticity, 1927. 
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When equations (1) are substituted in (2) and the coefficients of like powers of 
z are equated, there results’ 





— —1 277 Uns aVn-2 re] 
— ik — 2 Ws + Ga + Gey tee | 


—1 aVnw 
3.3)° W,= Ja —2 vw. =| 
(3.3) x1 —s) ( a) ~ 
By successive applications of the recurrence relations (3), U,, V,, and Wa 
are expressed in terms of Up , Vo, Wo, U1, Vi, and W,. The results are 








(4.1) Un = (-1)'v bi Ur+k I, 
(4.3) Wey = (—1)‘v™ ja — 26 — k)W, — (2 + vm), 
Ox oy 
(4.4) Uns = (-17'9"" | Ui+ pe, 
as k o2k—-2 au, , avi 
(46) We =(-1)'v t _ 24 26)Wy + (w: F 7“), 
where 
(1 — 20)W, = e+ tM, 
(1 — «) We = 98 4 a VW, (k=0,1,2,3, -+-). 


When formulas (4) are substituted in (1) there results 


id —_ aw, 2k 
w= (-1)'V peg = le 


(5.1) om 2k+1 
ktk-2 r cin 
= — k o2k—2 - = au, , avi * 
on % (vives |@— 242 s+ (Bs *] iy! 
; ke@w2k . aU aVo Pane 
+X ute[a — 26 — wm, — (22 4 0 


These expressions for the displacements satisfy formally the stress equations 
of equilibrium throughout the body. 


7 Here and henceforward V? = bd + a . 
6x? = Oy” 

8 (3.2) is omitted because it can be obtained from (3.1) by interchanging z and y, U and 
V, ete. Throughout this paper an equation will be omitted when it can be obtained from 
the preceding equation in this manner. 
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b. Surface traction equations. A right-handed coérdinate system with the 
faces of the plate z = +h, x = 0,2 = a, y = 0, and y = b will be used. The 
x, y, and z components of the surface tractions will be designated by li, J1, 
and P, respectively on the upper face and by Lz , J2 , and P: respectively on the 
lower face. Using the notation of Love (p. 77), the surface traction conditions 
on the upper and lower faces may be written as 


(Xz) z= ” lh, (X2)2—-r = Iz, 
(6) (Y 2) enn = J; ’ (¥ Jeo = Jo, 
(Zz) 2mh = P,, (Z 2) 2——h = P, ° 


The stresses in terms of the displacements are (Love, p. 101) 
2u ow du . dv 
Z.= 10. a> ~~ 4 
1 — 2e ja °) az - 0 (2% i =) 
ou ow dv . dw 
ie == = ~~ ED Y, -_ 
(S+5) H(i + 5): 


Substitute equations (5) in (7) and the results in (6). Then take the sum and 
difference of the two resulting values of Z,, X., and Y,. The final equations 





(7) 























are 
= keo2k-2| @2y7 hk au, av, 
(s) 2, (- vv | vu: (3 er) 
_tsitey(% me h _In+T1, 
l-—o (2k)! Qe ’ 
= k wy2k k—l+ov re — hata (48 vi) - 
tian 2 (-v'v a ve 2(1 — a) ei 
— P, 
7 me, 
. kt+1 ok | 62 na aU, <7) 
(11) 2 (-1) . I —_ +7 (Se + azay 
4 Mk + @) | n* _h—Il 
1—2c ax |(2k+1)! yh ’ 








< k+1 2k bas (20 ve) k-—l+eo ‘lan Pi+ Po 

a3) 2 (-yv = me *e)t 1-e ae 
Equations (8), (9) and (10) involve only Wy, U; and V, and equations (11), 
(12) and (13) involve only Uy), Vo and W,. These two simultaneous systems 
of equations can be solved by an indirect process due to Sibert.’ This process 


® Loc. cit. 
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requires that Uy), U1, Vo, Vi, Wo and W, be expressed as infinite sequences 
of terms of ascending order of magnitude. Let s represent a first degree func- 
tion of x, y. Order of magnitude may then be defined as follows: If r and t 
are two functions of s which contain the same number of terms, ¢ is defined to 
be of the n-th order of magnitude as compared to r if each term of ¢ is propor- 
tional to (h/s)" times the corresponding term in r. It then follows that h°"V°"r 
is of the 2n-th order of magnitude as compared to r. It is necessary to assume 
that Uy, Vo, Wo, Ur, Vi, and W, are expressions in z, y which involve h in 
such a manner that their terms can be grouped and arranged in ascending order 
of magnitude. 

Since equations (8) to (13) inclusive have been arranged so that only even 
powers of h occur in their left members, it is only necessary to provide for even 
orders of magnitude. Therefore 


(14.1) Uy. = py Ueno, U,= yi Vena, 
(14.3) Wy, = y Won, Wi = ) Won,iy 
n=0 n=0 


where Wano, Ueno, Vono, Wena, Uena, and Ve, are of the 2n-th order of 
magnitude as compared to Wo, Uo, Voo, Wa, Un , and Vm respectively. It is 
assumed that Wo, ---, Vo, being the terms of lowest order of magnitude, 
do not vanish identically unless Wo , --- , Vi respectively are identically zero. 

For simplicity the problem will now be restricted to the case of a normal 
surface load only. The solution for the case of a shearing load is very similar 
to this case. By superposing these solutions, the results for more complicated 
problems can be obtained. 


c. General solutions for W,, l',, and V,. In order to solve equations (8), 
(9), and (10) simultaneously it is necessary to write each one as an infinite system 
of equations by equating terms of the same order of magnitude. The right 
members of equations (8) and (9) are now zero, but the order of magnitude of 
the right member of (10) must be determined. Assume it to be of the same 
order of magnitude as VWo. Let Pi — Pe = —p; , where p; is a function of 
x, y. Then the equations of lowest order of magnitude in (8), (9), and (10) 
may be written 


(8.0) Un + ve = 0, 


da , Vu _ 

ax ay Quh- 

Un 5 OVe  o. This result is in- 
Ox Ce) 


consistent with (10.0). Therefore p,/(2uh) must be of the fourth or higher order 





(10.0) v We + 


However gives V° Woo + 








a(8.0) , a(9.0) 
Ox + oy 
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of magnitude as compared to Wo. It will be assumed now and proved later 
that p:/(2uh) is of the fourth order of magnitude as compared with Woo. Equa- 
tions (8), (9), and (10) may now be written as infinite systems as follows: 


= 00 


(8.0) Un + = 0 





? 





, * Qn, 2 27,7 k al n—2 
l 2n,1 + + > (— 1) Vv is bs U 2n—2k,1 fb pee (= ae 


Ox" 
ia *y k W. n* 
O Ven—2x,1 —l+o OW 2n—2k,0 v 
ent RTE Sg quem ———— | = ] 3 eee) 
+ axdy ). l-—@o € ax a) (2k)! eG 1% 3, -++) 
( 
Since oe) + oe = (10.0), it is necessary to form another system of 





€ 
equations by subtracting °69) + oe.) from (10.n). The resulting system 
Y 
> 10 





1s 
(15.0) 0=0, 
2} / avn aVn —2 2p1 
_ nth shah .g TN =f 
(15.1) Vv 2 o)( - + - ) ov We | D? 
> (—1)*8'y? ‘La tae (ee 5 ae 2n—2k, Venu) 
(15.n) ma a ekn?t? 
. _— 
_ (k _ 1 a o)V'W ran (2k + 1)! = 0 (n = 2, 3, 4, ee *). 


Systems (8), (9), and (15) can now be solved simultaneously for Wen, Vana 


a(8.0) 4 a(9.0) Bice 
Ox oy 
2 aUm , Va 247 | _ 2bopr 
V | 2 — «(20 + van: D”’ 


2 au 01 oo i 2Qdopr 


respectively, where bb) = 1 and d) = 0. The simultaneous solution is 


and Ven,. Equations (15.1) and the Laplacian of 




















8Um , AVan\ _ 
= ra) = pied + bol. 





V'Wo = = ple — a)dy — bol, v( 


10 = OO SSO, 
3(1 — o) 3(1 — o?) 
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a(8.1) , a(9.1) 











Equations (15.2) and the Laplacian of ———~ + ——— become 
ox oy 
2 aU °F) — Qh bi Vp; 
Vi (2 — ——— —}—oV Wa} = —— 
(2 o)( ax a ay o J 2 D ’ 
° aU OVa a 2h dv" Pr 
Vv - VWa | = —.—, 
isha hale fal 
. 1 | a : rig 
respectively, where b} = = and d; = — —. The simultaneous solution is 
5 2(1 — a) 
i. » fale av. Ms 
vi 2 >= D V pi l(2 _ o)dy; _ bil, v( — a vr) = D V piled, oa b). 


Continuing in this manner, by solving (15.n+ 1) with the Laplacian of 
a(8.n a(9.n 
( dy (9.n) 

oy 


F , one obtains the following general solutions: 
x 











(16) V' Won = a Vv" m[(2 — o)d, — byl, 

(17) v° (“as + V1) = a V" plod, + bal, 

where 

8) b= } (1 8+ DUG + ane Ee I) = ode rd 

(19) d= : (-1)' © fe (n = 1, 2,3, ---) 


Sibert has given upper bounds of the sequences b, and d,, as 


san 1 , 1 

Kies. tat~es = 23,..-). 

Hal S § 1@=l < 3141 — @) in 
Since (16.0) is T'Wo = +. it is the differential equation defining the vertical 


displacement in thin plate theory (Love, p. 488), and Wo is the vertical dis- 
placement of the corresponding thin plate. Consequently the vertical displace- 
ment Wp» of the middle surface of a moderately thick plate is made up of the 
corresponding thin plate displacement Woo plus corrections Wa , W4 , ete. 

By combining equations (16), (17), and (8), Us,, is obtained in terms of 
We,o. Similarly (16), (17), and (9) yield V2,,. The results are 
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(20.0) a 4 * 0 0, 
Ven, 1 a — 10 + - : i (2 - 
. a Ox 
(20.n) 
on we? “wo — ss 
+ h"v" (s) 43 (2 — Sine ) =] =0 (n=1,2,3, --:). 
— ¢ 


al "on,1 on 





If We,» is eliminated from equations (20) and (21) the relation 


2n 


Vena « , pn ; ’ ’ ee 
—— is obtained. This relation combined with (17) gives 


74 a he 2n api 
(22) Vv Uena = D [od, + b1V (+). 


It is necessary to complete the proof that p:/(2uh) is of the fourth order of 
magnitude as compared to Wo. It has already been shown to be either of the 
fourth or of a higher order of magnitude. Assume that its order of magnitude 
as compared to Wo is greater than the fourth. Then equations (15.1) and the 
a(8.0) 4 a(9.0) stieiieie 

Ox oy 


2t /, aa =) 2 o 
v[ 2 — 0) (Us + Fe) _ ovr] = 1 
vy | (ee + Va) f. We = 0, 


respectively. The simultaneous solution is 


4 aa os) - 
Vv Wo = (Ue ai) = 0. 


Laplacian of 











Since the trivial solution V‘We,,0 = 9 does not depend upon the load, p; must 
occur on the right side of some one of equations (16). But since (16.0) is the 
equation of lowest order of magnitude in the system (16), its right member 
cannot be zero unless the right members of all equations of the system are zero. 
From this contradiction it follows that the term p;/(2uh) must be of the fourth 
order of magnitude as compared to Wo . 

Finally, equations (16), (17), (22), and (23) substituted in (14) give the fol- 
lowing general solutions for Wy , U; , and V; : 


(24) VW = > *v 2". [(2 — ada — Dal, 
(25) VU; = > hoy (2!) dy, + dal, 
n=0 D 
‘ au, avi _ = h*" 2n 
(27) a(% + . v1) = > ) Vv piled, a b,]. 
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In 1899 J. H. Michell” published the differential equation defining W» in 
the form 


4yyr ton e 2 ) | l+o 2 OZ. 
iain (0 + 33) Ge lie EY \be Jone’ 


It is easily shown that this solution becomes identical with equation (24) when 
the stress Z, is expressed in terms of the load. 





d. General solutions for W,, Up, and Vo. Let P; + P2 = —pe, where pe 
is a function of z, y. It can be proved that the right member of (13) is of the 
same order of magnitude as Wy. Then equations (11), (12), and (13) can be 
written as infinite systems of equations. These systems can be solved in essen- 
tially the same manner as that used to obtain the simultaneous solution of sys- 
tems (8), (9) and (15). The general solutions for U» , Vo and W, are 











bed 2n 
(28) vw.=--> 7 Vv"? [(1 — aden + oan], 
n=0 
a A” onie/8 
40 = mee Ope <—_ 
(29) ViUo 2. an Vv (2) fc, + (1 — o)a,], 
2(0Uo , AVo\ _ <a he” ents 
(31) V (es + ors) = > _* prlocn + (1 — o)an), 
where a = 0, @ = 1, 
e-1i . . \ 
(32) e, = p> (1) (i + 2)an—1-i — (i + 1)en—1- | 
i=0 (27 + 3)! | 
n-1 (+1) }(n = 1,2, ---). 
aie _ yy + Dani — Uni 
Sibert has given as upper bounds for these sequences 
lal< 3, lel < gam (n = 1,2,3---). 


The displacements u, v, w are given by relations (5) when the six functions 
Uo, Vo, Wo, U1, Vi, and W,; are known. Therefore one can say that the differ- 
ential equations (24) to (31) inclusive define the displacements. Furthermore, 
the displacements defined by these differential equations satisfy the equilibrium 
equations and the surface traction conditions for any normal load which can 
be expressed as a polynomial in zx, y continuous over the entire plate. It re- 
mains to solve these differential equations subject to particular sets of edge 
conditions. 


11 J. H. Michell, On the direct determination of stress in an elastic solid with application 
to the theory of plates, Proc. Lond. Math. Soc., vol. 31 (1899), pp. 100-124. 
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3. Normal load, pi(x, y) = po(x,y) = P(r i hs = + *) 


1. Preliminary relations and remarks. The displacements will now be found 
for a normal load Ppat® + — = + *¥) on the top surface of the plate where a and b 


are the horizontal dimensions of the plate, A, a@ and 6 are arbitrary constants, 
and P is a uniform load per unit of area. In this case equations (24)-(31) 
become 


‘ iw . —P ax , By 
(34) v W o = “D ( oe my + BL), 

- 4p, Pa 
i V1 = aD’ 
‘ 9 al 7 av; P ar or) 
on v (3+ F)- B+ s+), 
(38) VW, = 0, 
(39) VU, = 0, 

2 al ly aV,y 
v ——) = @ 

- & " =) 


The load is a linear function of x, y and it is of the fourth order of magnitude 
as compared with Wo). Hence all terms whose orders of magnitude are 2 6 
vanish. Therefore all infinite systems and sequences of the previous section 
become finite. The following results are readily deduced. 


, _ _ OW h* _.faWs h'(5 — 20) (2%) 

9 j = —— - . a i ‘Seuhimandtn 
(42) we an ~.0(%) 6(1 — o)* ai ax J’ 
eee =v al "0 aVo Pa _ 2c) ar #) 
(44) m= = (% + y) ee (ee , 


_ pis au aVo - 2 

w= Uet Ue —[0U rt 42 (@+ Vs 4 wi) |S 
2—c-(aW.\2 . (3—2c)h? .faW\ 2 3—c4faWo.\2 
,haa’ (on + 3a —eF” (= ‘5 he er (Fe) 5, 


CW +h! ow, - tte, 2 
< o td °* Mi = oF ae at 


(45) 





(47) w= Wo+ Wiz + 


The problems of plane and generalized plane stress are very easily solved 
by use of equations (34) to (47) inclusive. For these problems P = 0, and 
since Wo; , 0Uo/dx and aV_/day are of the same order of magnitude as P, they 
may be chosen equal to zero. With these simplifications, the values of the stresses 
obtained from equations (45), (46) and (47) agree with those given by Love 
(pp. 467-471). 
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The problem of this investigation is restricted to moderately thick plates 
because the tractions applied to the edges are represented by their force- and 
couple-resultants taken along a vertical element of an edge. The boundary 
conditions at an edge will be defined in terms of the components of these re- 
sultants; namely, 7, S, N, G, and H (Love, p. 455). At a pinned edge 7’, G, and 
Wo vanish. At a free edge 7, G, N — @H/@s vanish, s denoting the direction 
along the edge. Ata clamped edge Uy , Vo , Wo , and @Wo/dn vanish, n denoting 
the direction of the normal to the edge line. These components must be ex- 
pressed in terms of the displacements before they can be used here. The results 





are 
T, = fo | he aVo Fat = +) 
l1—ocl dz oy 4u a b 
at h l l Vo\\ 
—h } 2 Ol o a Alo 0 o 
Hoar (2) + £8) 
err —4yh* aw, aw, 8 — 3e Penh a7 8+ 06 2 (=) 
(50) Gr = sen Se + ore + pore ay Me — Ea) | 
y, — 22 _ —4uh? « oe Wo , a Wo 
_ - wa=2 array ' ay’ 
(52) 


s a o 2 0 Wo (8 _ 30)h" 4 ry] 
si + atve( he) 4 Ga Bek ay ) | 


Further, each equation which results from equating quantities (48) to (52) to 
zero must be written as a system of equations by equating terms of like orders of 
magnitude. The equation of lowest order of magnitude in each case will be 
referred to as (48.0), (49.0), --- , (52.0); the next order will be referred to as 
(48.1), (49.1), --- , (52.1), ete. 

The method of solution used here does not give Wo directly. First Wo is 
obtained, then Wo , etc., and finally Wo is obtained from equation (14). It has 
already been pointed out that Woo is the vertical displacement for the correspond- 
ing thin plate under the same normal surface load. By observing that condi- 
tions (50.0), (51.0) and (52.0) are precisely the corresponding edge conditions 
from thin plate theory, one may employ the Wo solution of the thin plate prob- 
lem in case it is available. The thin plate solution for the particular load of this 
problem has not been recorded and it will now be obtained for three sets of edge 
conditions. 


b. Solution for Wo. The following Fourier expansions which are valid in 
the interval 0 < x < a will be needed. 





nN 2, _ 4A sin@x 2 = oe 
(58) ges ee ae 
. Qa, es 3) 2a nei sinér 2 — .sin Ox 
(54) ga @t-H= 72h (-D) - a? ee ’ 
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oo a 8<¢ ~ Sin 6x 
. + m= ae 
(55) 2s: »» (i + j) — 
= ra’ (/x* 2x° x 2 — .sin 62 
56 eal dl he ne 
(56) (3 ete a! eae ’ 
4 5 3 ) 
2x 7x 2 . sin 6x 

57 <(é _ 2x’ re ) _ 2S ; sin or 
(57) 24 \5a® = 33a + 15a ad ge ’ 
where 

nr ‘ n . n+1 n 
ale. i = A[1 — (—1)"I, j = a(-1)"", m = B[1 — (—1)"]. 


In the problems to be considered, the edges x = 0, a will always be pinned. 
A solution satisfying edge conditions Wo = 0 and (50.0) at x = 0, a and satis- 
fying the differential equation (16.0) or (34.0) is given by 


ee — Pa‘ es) (Z 22° ‘) (= 2x° 7x )] 
Woo = 4D [+4 at ae + a) + *\Sat ~ Bait 6a 
—Pwo sin 6x 
wD 9° 





(58) 





[A, shéy + B, chéy + C,6y shéy + D,6y ch by], 
where the constants A, , B, , C, and D, are to be determined by the conditions 
at y = 0, b. 


Case I. Theedgesy = 0,bare pinned. The boundary conditions are Wo = 0 
and (51.0). When (58) is substituted in these edge conditions and use is made 
of the appropriate Fourier expansions, the following system of equations is 
obtained. 


B, = —2(¢ + 9), 
A,sh¢d + B,ch¢ + C,gsh¢ + Didche = —2(¢ +7 + m), 
B,(1 — o) + 2C, = 2e(i + J), 
A,(1 — o)sh¢ + B01 — «) ch¢ + C,[o(1 — co) shod + 2ch¢) 
+ D,[o(1 — ¢) cho + 2sh¢] = 20(i +7 + m), 


where ¢ = 6b = nrb/a. The simultaneous solution of this system of equations 


1s 


(¢ + j)(1 — ch¢)(2sh¢ — $) + m(2sh¢ + ocho) 
a sh? @ : 





& 
| 


- — 2a + Dp; 
(i + j), 


“~ 
-2 

2 
ll 


p, = &+I0.— che) + m 
. sh @ . 
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These values substituted in (58) constitute the solution for Wo for y = 0, b 
pinned. When d = 1, this solution reduces to that published by 8S. Iguchi.” 
Case II. The edges y = 0, b are free. The boundary conditions are (51.0) 
and (52.0). The substitution of equations (53), (54) and (58) in these edge 
conditions yields 
B,(1 — o) + 2C, = 2o(t + Jj), 
A,(1 — c)sh¢ + B,(1 — a) cho + C,[2ch¢ + (1 — o)dsh¢] 
+ D,[2sh¢ + (1 — a)¢ch¢] = 2o(t + 7 + m), 
A,o(1 — o) — Diol + ¢) = —2m(2 — o), 
A,g(1 — «) cho + B.@(1 — o) she + C.o[(1 — o)p che — (1 + a) sh] 
+ D,¢[(1 — o)¢sh@ — (1 + «) ch¢] = —2m(2 — @). 


The simultaneous solution of this system of equations is 


2o(1 + o)(i + j)(1 — chg) 





























ee << oy 
2m[o(1 + o)o(R — S chg) — (2 — o)(R — S)(2sh¢@ + Sy] 
+ x SiR? — 82) 

“a 2e(t + JIC + ¢) shod — - S)__ 4m|ooS sh @ + (2 — «)(1 — ch@)(R — S)] 
so (1 — «)(R — S) S(R? — S?) 

ios 2o(i + j) sh¢@ 2m[ooS sh @ + (2 — o)(1 — ch o)(R — S)] 
sane R-S o(R? — S*) 

os 2o(i + j)(1 — ch) , 2ml[(2 — o)(R — S) sho + op(R — Sch ¢)) 
R-S = $(R? — S?) 


where R = (3 + c) sh¢@ and S = (1 — a)d. These values substituted in (58) 
constitute the solution for Wo for y = 0, bfree. If\ = 1 anda = 8 = 0, this 
solution reduces to that published by D. L. Holl.” 

Case III. The edges y = 0, b are clamped. The boundary conditions are 
Wo = @Wo/day = 0. Equation (58) and necessary Fourier expansions com- 
bined with these edge conditions yield 


B, = —2(i + 9), 

A, sh¢ + B,ch¢ + C,¢sh¢ + D.oche = —2(i + j + m), 

Aid + Did = —2m, 

A,och¢ + B,dshd + Crd@che + sh¢) + D,o(sho + ch¢) = —2m. 


128. Iguchi, Eine Lésung fiir die Berechnung der biegsamen rechteckigen Platten, Julius 
Springer, Berlin, 1933. 

13 —). L. Holl, The deflection of an isotropic rectangular plate under the action of continuous 
and concentrated loads when supported at two opposite edges, Iowa State College Journal of 
Science, vol. 9 (1935), pp. 597-607. 
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The solution of this system is 


4, = — MALI — ho) _ Indll — ch) 
— o+sh¢ ¢—sh?o ’ 
B, = —2(% + dD; 
— 2(i + j)shd | 2m[¢ she + (1 — ch¢)( + sh¢)] 
: @+sho o(¢? — sh? ¢) 
~—s 2(7 +3) Ma ch 9) 2m(sh° o-—¢ ch ¢) 
ss @+sh¢ ¢(¢* — sh? ¢) 


These values substituted in equation (58) constitute the solution for Wo for 
y = 0,bclamped. If \ = 1 anda = 8 = 0, this solution reduces to that given 
by Holl.” 


c. Solution for Wo. With Wo known it is now possible to solve for Wo. 
Since W2 is a bi-harmonic function of the second order of magnitude as com- 
pared with Wy , a solution of the following form is assumed. 


Wp = = | (x + ee) (x” — ar) +3 ead (a = ‘2)| 


(59) ; 
_ 2Ph 7 bx 


[F,, chéy + G, shéy + H,, dy chéy + I,6y sh 6y). 


— (8 — 3c) 
mia“ 
H,,, and I, will now be determined by the conditions at y = 0, b for all three 
cases. 

Case I. The edges y = 0, b are pinned. The edge conditions are Wa = 0 
and (51.1). The results are 


Pinned edge condition (50.1) at x = 0, a requires that A = 








oe — 3c) 

F, = - g) itd) 

G.= ia 30) LG i+ ij) — ch ¢) + m|] 
a * =a sho ’ 
H, = I, = 0. 


When these values are substituted in equation (59), the solution for Wo , for 
y = 0, b pinned, results. It is easily shown that W2 may be written in the form 


(8 — 3c)h? 


(60) Wa — - 1001 — «) 


VW. 


14 Loc. cit. 











> ss 
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Case Il. The edges y = 0, b are free. The edge conditions are (51.1) and 
(52.1). The results are 





» _ —8 —(8 + a) 1 8m(1 — ch) 
"oe" ga — a) + 5+ 
i aa +2) ' 4m(2sh b+ S) 
ale 1001 — zt j Ant 5S(R +S) ’ 
—— 1p. 4m sho 


5" 5¢(R +S)’ 
1, =1¢, — 4m(l= che) 
"5°" 5e(R + 8)’ 


where A,, B,, C, and D, are the constants determined for Wo for Case II. 
Wx» may be written in the form 





- _ —(8+ OP ian. Qh° a “Woo 4Ph° sin 6x 
We = Soq ey ° Vo- = ae + eb & 


i my 2m 
(61) E i+jg+ — + dR +5) {—2(1 — ch@) chéy — (2sh¢ + S) sh by 


+ (1 — o)éy sh ch by + (1 — o) by(1 — ch) shy} |. 


Case III. The edges y = 0, b are clamped. The edge conditions are Wx = 


OW = 0. The results are 





dy 
8 — 3c 8 .— oy 
F,= — — B, in = a nr Cn» 
20(1 — oc) y G 20(1 — a) 
8 — 3c 8 — 3c 
H,, = —_— Da, 2, = SS 
20(1 — oa) . 20(1 — o) 


By use of these values solution (59) may be written as 


—(8 — - 3o)he a WwW 00 


(62) "a= ‘100 —o) ax * 


When A = landa = 6 = 0, all three of these solutions for We reduce to those 
given by Garabedian.” 


d. Solution for We, (m 2 2). Now it is possible to solve for Ww. Since 
Ww is a harmonic function of the fourth order of magnitude as compared with 
Wo , a solution of the following form is assumed. 


‘ A By Ph* sin i 
(63) Wa = a+ po er) + 


D [J, chéy + K, sh 6y]. 


15 Footnote 4, vol. 195. 
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The boundary conditions at + = 0, a are Wy = 0 and (56.2) which reduces to 
Ww 
ax? 

determined by the conditions at y = 0, b for each of the three cases. 

CaseI. The boundary conditions at y = 0, b are Ws = 0 and (57.2). These 
conditions require that J, = K, = 0. Therefore Wo = 0. By inspection of 
equations (16) it readily follows that Wen = 0 (n = 2). 

Case II. The free edge conditions are (57.2) and (58.2). They require that 


_ 28 +4) _ 28 +0) 
“Fite | Ke FG) 


With these values solution (63) may be written 
wf Sb | Ww 8 — 30 ‘( ar au) 
(64) vee ee VWs - ais" > =] +> “ 


By a consideration of orders of magnitude or by (16) it is evident that Wa.o = 
0 (n > 2). 


Case III. The clamped edge conditions are Wy = 


= 0. These conditions require that A = 0. J, and K, will now be 





Jn 


aw. 
* = 0. These con- 





ditions are satisfied when J, = K, = 0. Hence Wy = 0. As before, it follows 
that Weno = 0 (nm 2 2). 

All of these solutions for Wy reduce to those given by Garabedian” for the 
case\ = landa = B= 0. 


e. Solution for 1) and V,). It remains to determine Uy) and Vp to complete 
the problem. It has been pointed out that Wm and a“ + oe are of the same 
order of magnitude as the load which is of the fourth order of magnitude as 
compared with Wo. Therefore 


aUm , aVo0_ 
= *G “*+*t 





+2. =e [F, ch 6y + G, sh @y + H,,6y ch by + I,,6y sh by). 
However equation (41) requires that this expression be a harmonic function. 
Therefore H, = J, = 0.. With this background one may consider the following 
forms of solutions. 


Uo =a +ar+eay+or +ary+oy + >, en [A, ch 6y + B, sh 6y], 





sin 6x 


Veo = ko + kix + key + kx? + kaxy + ksy? + >> ab [C, ch 6y + D, sh by). 





16 See footnote 15. 
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It is obvious from the definitions of a pinned edge and a free edge as given 
in this paper that T = 0 is the only restriction on U» and Vo for Cases I and II. 
Since this condition is not sufficient to determine all the constants in the above 
general forms, the solutions obtained here must be recognized as satisfactory 
solutions and not necessarily the unique solutions. Garabedian” avoided this 
difficulty in his solution for a uniform load by assuming that Uoo and Voo were 
closed linear functions of z, y. However, this assumption does not permit a 
satisfactory solution in the pinned-clamped case as it requires that 7’, be different 
from zero at the pinned edges x = 0, a. Love (p. 462) adds the condition 
S = 0, but this condition does not yield a reasonable solution. Furthermore 
S. Woinowsky-Krieger™ has shown that S is different from zero in general. 

Equation (48.0) is the only condition on Uo and Vo at z = 0, a. It gives 


AP P P 
a+ ake = Fo, c+ Qoks = Fe, 2es + oky = Fo. 
Therefore choose 
O=aQ=—-Gg=~k=—-kh =k = 0; 
-_— & a _. a 
C1 = he = ei Cs = 2ks = orp 3 2cs = ks = oo 


where the quantity E is Young’s modulus. .Equation (49.0) at y = 0, b yields 
D, = cA, and C, = ocB,. If the plate be fixed in space by making Voo = 0 
at y = Oand Uw = Oat x = 0, then B, = A, = 0. Consequently D, = C, = 0 
and the final solution may be written 


a ax , By 
(65) Ue= Se(a+ +), 
rg = BE, 


Since Ue and V2 are of the second order of magnitude as compared to Uo and 
Vo , they must either be zero or constants. But Voo = 0 at y = 0 and Us» = 0 
at x = 0 to fix the plate in space. Therefore they are zero everywhere. Conse- 
quently Ueno = Vono = 0(n 21). This means that (65) and (66) give the 
complete horizontal displacement of the middle surface. Although not a unique 
solution it is a rational one in the sense“ that it yields a displacement at the 
middle surface which is one-half of what the displacement would be if the plate 
were resting on a complete foundation. If \ = 1 and a = 8 = 0, these solutions 
reduce to those published by Garabedian” for a uniform load. 


17 See footnote 15. 

18 §. Woinowsky-Krieger, Der Spannungszustand in dicken elastischen Platten, Ingenieur- 
Archiv, vol. 4 (1933), pp. 203-226, 305-331. 

19 See footnote 15. 
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The values of Uo and Vo for the pinned-clamped case are most easily ob- 
tained by imposing the conditions Us = Vo = 0 at y = 0, b first and then 
requiring that 7; = Oatz =0,a. The final results are 








a Fl ar —§ (A + a) — FY 4 ae ue 
2E 6 
i 2 G+ 9)0 h @¢) 
cos 6x + J —ch¢d) +m 
) a C + j) chéy + ° =< snoy}|, 
: oP <3 > . sin 6x |2(¢ + j) +m | 
(68) Vo = oR | y + — + > 5 (M+ Dt @ sh by? |. 


It readily follows that Ueno = Varo = 0 (n 2 1). 

Woinowsky-Krieger’s” solution for an infinite plate strip pinned at the edges 
x = 0, ais readily obtained from any one of the three cases solved here. It is 
only necessary to set Vo = 0 and redetermine Uy to satisfy the conditions at the 
edges x = 0,a. Then in the limit as b — ~ the solutions given here reduce to 
those Woinowsky-Krieger gives for the infinite plate strip. 


Iowa State COLLEGE. 


20 Loc. cit. 
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ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 
By Hasster WHITNEY 


1. Introduction. Our object is to give an elementary account of some alge- 
braic theorems, with some immediate applications in combinatorial topology, 
in particular, in the theory of homology and cohomology’ groups. The theo- 
rems are to a certain extent known, if in somewhat different forms. 

The main tool in the algebraic part is the theory of group pairs, and in par- 
ticular the question of when one group “resolves” or “completely resolves” 
another. The main theorems are on the existence of extensions of a homo- 
morphism, and the existence of solutions of linear equations, with a matrix of 
integers and elements of an abelian group as unknowns. In each theorem, two 
types of conditions are employed, one using mod m properties, the other using 
group pairs. We shall use only discrete groups (except in Theorem 1). 

The applications to topology are concerned with the relations between the 
ordinary homology theory, and the newer “dual” theory. An illustration of 
the convenience of the newer theory is given in Appendix I. For other illus- 
trations, we mention the duality theorems (Kolmogoroff, Alexander, Cech, etc.), 
and properties of maps (see a following paper). 


I. Group pairs and homomorphisms 


2. Group-pairs. All groups will be abelian. 0 will denote the identity in 
any group. Let 


mG = all elements mg, g in G (m an integer), 
m@ = all gin G such that mg = 0. 


Then 0G contains 0 alone. G — G’ is the difference (factor) group of G over 
the subgroup G’. 

Let G, H, Z be three groups. If to each g in G and h in H there corresponds a 
z = ghin Z, and both distributive laws are satisfied, we say G and H form a group 


Received December 1, 1936; presented to the American Mathematical Society Decem- 
ber, 1936. 

1 The relation of “‘coboundary”’ is becoming of increasing importance. Other terms 
have been used: dual boundary, upper boundary, inverse boundary, boundary in the dual 
subdivision, derived (of a function). The present term (accepted by E. Cech) offers 
definite advantages. In differential geometry, the (exact alternating) covariant tensors 
and the contravariant differentials play the same réle as cocycles and (contra)cycles in the 
combinatorial theory; in fact, they may be obtained directly by a passage to the limit. 
Moreover, the prefix co is very convenient to handle. 
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pair with respect to Z. If gh = 0, we say g and h are orthogonal. For any 
subgroup H’ of H, set 


(G, H’) = nullifier’ of H’ in G = all elements g in G orthogonal to every h in H’. 


Define (H, G’) similarly. 

If (G, H’) = 0, we say H’ resolves G.’ If (G, »H’) = mG, we say H’ m-resolves 
G.* Clearly “H’ 0-resolves G” and “‘H’ resolves G” are equivalent statements. 
If H’ m-resolves G for all integers m = 0, we say H’ resolves G completely. Note 
that 


(2.1) if H’ resolves G, then (G, mH’) = »G (all m). 


(For m = 1, this is the definition.) For if mg # 0, choose h in H’ so that 
(mg)h # 0; then g(mh) ¥ 0. 

Let G and Z be groups. <A Z-character of G is a homomorphism of @ into 
(part of) Z.° Let Chz(G) be the group of Z-characters of G. Given g in G 
and ¢ in Chz(G), set g-¢ = $(g); then these groups form a pair. Clearly any 
group resolves any of its character groups. The converse is false. 

G is infinitely divisible if mG = G for each m. 


3. Examples. Let Jy be the group of integers, and J, = I) — ulo the group 
of integers mod yw. Using ordinary multiplication into Jq,,), it is seen that 
I, resolves I, if and only if u is a multiple of v. J» m-resolves no group G unless 
m = 0orlormG =G. The general question of when J, m-resolves I, is rather 
complicated. However, I, completely resolves I, if « > 0 and ordinary multi- 
plication mod yu is used. This is a consequence of Theorem 1 below.’ If Ri 
is the group of the real (or the rational) numbers mod 1, then Jp and R; com- 
pletely resolve each other, as is easily seen; this also follows from Theorem 1. 

R;, is infinitely divisible; no group with a finite number of generators is. 

TueorEeM 1. A discrete or compact (or locally bicompact') group completely 
resolves and is completely resolved by its R\-character group. 


2 The term annihilator has been used, but this word seems unnecessarily barbarous. 

3 If H’ resolves G, and gi ¥ g2, then (gi — g2)h # 0 for some h in H’, and gih # goh. 
Thus distinct elements of G may be shown to be distinct by multiplying by elements of H’. 

4 Thus, by multiplying by elements of H’, we can tell whether a given element of G is 
divisible by m or not. Any H’ 1-resolves G. 

5 It is often important to introduce a topology into groups and character groups; we 
shall not do this here. 

6 Direct proof: Note that 


mI, = all numbers in J, = 0 mod (m, u), 


ml, = all numbers in J, = 0 mod u/(m, x). 


Suppose ab = 0 mod uz for allbin,J,. Taking b = u/(m, uv) shows that a = 0 mod (m, x), 
and a isin ml,. 

7 See L. Pontrjagin, Annals of Math., vol. 35 (1934), pp. 361-388; also van Kampen, 
Annals of Math., vol. 36 (1935), pp. 448-463. 
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That G and H = Che,(G) resolve each other is shown by Pontrjagin, loc. cit., 
Theorem 5 and Remark 4. Now set H’ = mH. As H resolves G, (G, mH) = 
m@, by (2.1). By Pontrjagin, Theorem 4, (H, (G, mH)) = mH; hence (H, ».G@) = 
mH, and G m-resolves H. As m is arbitrary, G completely resolves H. Sim- 
ilarly H completely resolves G. 

Note that a group may resolve and be resolved by its character group, and 
not resolve it completely. For an example, take G = H = Z = [,. 


4. Extensions of homomorphisms. Suppose part of a group is mapped into 
another group. Can the map be extended through the first group so that it is 
a homomorphism? We shall give two answers to the question. 

TuroreM 2.° Let A be a free group with a finite number of generators, let A’ 
be a subset of A, and let f map the elements of A’ into the group G. The definition 
of f may be extended over A so that it is a homomorphism if and only if (a) holds: 

(a) For any elements a, , --- , a, of A’ and integers m = 0, a, --- , a, 


(4.1) > aia; in mA implies ; ai: f(a;) in mG. 


Suppose that A and H, and G and H, form group pairs, and let H resolve G 
completely. If the following condition holds, the extension of f over A is possible: 
(b) For any elements a, , --- , a, of A’, integers a, , --- , a, , and element h of H, 


(4.2) (dS aiai)h = 0 implies (SY aif(ai))h = 0. 
The necessity of (a) is clear: 


> af(ai) = f(>X aiai) = f(ma) = mf(a). 


We turn to the sufficiency. Let A* be the subgroup of A generated by the 
elements of A’. For each a = >> aja; in A*, set 


fla) = Le asf(ai). 


To prove that f is uniquely determined in A*, suppose 
,» ajay = _ Bia; , = (8; me a;)a; = 0. 


8 The theorem will be strengthened in Appendix II. It is easily seen that the conditions 
may be weakened as follows: (a) and (b) may be replaced by the corresponding pairs of 
hypotheses 

(a1) Ya,a; = 0 implies La,f(a;) = 0, 
(az) a in mA implies f(a) in mG; 
(b:) Sa;a; = 0 implies (Za;f(a;))h = 0 (all A), 
(be) ah = 0 (any fixed h, a in A’) implies f(a)h = 0. 
Further, the values m = 0, 1 may be omitted in (a). We note from the proof that (b) 
implies (a). 
The theorems in Alexandroff-Hopf, Topologie, I, pp. 591-3, are consequences of the 


strengthened theorem (using footnote 6). Our proof has naturally much in common with 
theirs. 
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(We may carry out both sums over the same terms.) Suppose (a) holds. Then 
as 0 is in OA, 


Dd (8: — adf(a) = DY Bif(ai) — DY aif(ai) 


is in OB, hence it is 0, and the last two terms are equal. Now suppose (b) holds. 
Then for any A in H, 


[> (6: - a;)a;] h = 0; 


hence 
[xX Bf(a.) — ¥ af(ad|h = 0, 


and as H resolves G, the term in brackets equals 0. 

Obviously f is a homomorphism in A*. Moreover, as each linear combina- 
tion of elements of A* is a linear combination of elements of A’, (a) or (b) is 
easily seen to hold in A* if it holds in A’. 

Let by, --- ,b-, Bea, --- , b, form a base for A, and let m,, --- , m, be posi- 
tive integers such that a; = mb,, --- , a, = m,b, form a base for A*.” Suppose 
(a) holds. As a; is in m;A, f(a;) is in m,G, and we may choose g; so that mg; = 
f(ai) @ = 1,---,r). If (b) holds, ash = bi(mh) = 0 for all h in ,.,H; hence 
f(aih = 0 for all A in ,,,H, and as H resolves G completely, f(a;) is in mG. 
Again g; exists. Set 


f(o) = 9: @=1,---,r), and f(b) =0 (@ =r+1,---,s). 


The resulting homomorphism of A into G obviously agrees with the one already 
found in A*. 

Remarks. If A’ is a subgroup of A with division (i.e., ma in A’, m # 0, 
implies a in A’), and f is a homomorphism of A’ into G, we may always extend 
f over A. For we may choose a base b,, --- , 6, , br, --- , 6, for A such that 
b,, --- , b, form a base for A’, and set f(b;) = 0(¢ > r). Further, any homo- 
morphism of any subgroup A’ of any group A into G may be extended over A, 
if G is infinitely divisible; see Pontrjagin, loc. cit., proof of Lemma 1. 


II. Linear equations with integer coefficients 
5. Cycles, etc., of a matrix. Let » be a matrix of integers: 





iM. ° Nn || 
(5.1) n = | ee ee tee | 

ll _p 

im mn || 
Given a group X, let X° = X 4+ -.-- + X, a direct sum with s terms. The 
elements of X* may be written (2, --- , 2.) or (2',---, 2°). We define the 
boundary and coboundary (with respect to n) of elements of X" and X” by 
(5.2) a(z', ---, 2") = (DS nj2’, ---, & nP 2’) in X’, 
(5.3) 6(11, +++, 2p) = (d miei, +++ 5 Dy Mri) in X". 


9 See Alexandroff-Hopf, loc. cit., p. 568, no. 24. 
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Any element (5.2) we call an X-boundary; these form a group B*(n). Define 
similarly X-coboundaries and Bx(n). 

Any element of X"[X’] whose boundary [coboundary] vanishes we shall call 
an X-cycle [X-cocycle]. We form with these the groups C*(n) and Cx(n). 

Given a group pair G, H, we form a group pair G*, H* by 


(5.4) GQ’, ---,g")-(,--+ he) = Deg’h. 
TueoreM 3. If H resolves G, or G resolves H, then 
(5.5) C°(n) = (G", Ba(n)), or Ca(n) = (H’, BY(n)). 


We shall prove the first; the other is obtained by transposing 7. For any 
G-cycle (g' , --- ,g") and H-coboundary 6(h;, --+ , hp), 


(5.6) (g', ---,.@") Ole, + hy) = Dg dD aih; = a(g', sa -,g')-(h, ~++,hp) = (). 
: ? 


Now suppose (g', --- ,g") satisfies this relation for all (hy, --- , Ap). Taking 
all the A, = 0 but the j-th gives (> nig )h = 0 (all h); as H resolves G, Yoni g' =0 
(all j), and (g', --- ,g") is a G-cycle. 


6. Linear equations in integers. We shall now prove 
Turorem 4.” Let » and G be given; if we use (8), assume H resolves G com- 
pletely. (g',---,g”) is a G-boundary, i.e., the equations 


(6.1) ni9 =g (i= 1,---,p) 
=. 


are solvable for g', --- , 9", if and only if one of the following is true: 
(a) For any integers m 2 0, a1, --- , &, . 


Pp é p ! 
(6.2) > ain} = 0 mod m (all j) implies > aig’ is in mG; 
“i i=l 
in other words, every I-cocycle mod m" (or, every In-cocycle) is orthogonal mod m 


to (g’, ere , 9’). 
(8) For any (hi, --- , hy) in H’, 


p Pp 
(6.3) > nih: = 0 (all j) implies > g' hi = 0; 
c=} =1 
in other words, every H-cocycle is orthogonal to (g' , --- , 9’). 
Example. Let G = I,, uw a prime; then » may be considered as a matrix of 
integers mod ». Using footnote 7, both (@) and (8) (with H = Z = J,) reduce 


to testing (a) for the single integer m = yu. 


1” A proof of the first half, (a), of this theorem, using a standard theorem on linear 
equations (see Veblen, Analysis Situs, Second edition, Appendix 2) has been furnished to 
me by H. T. Engstrom. 

11 z is a cocycle mod m if its coboundary is divisible by m; it is orthogonal mod m to y 
if zy is divisible by m. We might state the condition as follows: (g', --- , g™) is in the 
“complete nullifier’’ of Cy,(n). 
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The necessity of either condition is trivial; we shall prove the sufficiency. 


Set A = If = all (a,, --- , a,) (a’s integers). Let A’ be the rows 7’, --- , 9” 
of n, and set 
(6.4) f(r’) = f(m, +++ m) = 9 (¢ = 1,---, p). 


Suppose f has been extended over A so that it is a homomorphism of A into G. 
Then set 

(6.5) g =f0,0,---), 9 =f0,1,---), 

Then 


n n 1 j n , ’ 
Did = De nFO, «+41, +++ 50) = fin, +++ ym) = 9". 
* aime i 


Thus we need merely show that (a) or (b) of Theorem 2 is satisfied. 
Suppose first (a) holds, and }* ain’ is in mA. Then >> aim} = 0 mod m (all j), 
hence >> aig’ = >> aif(n') is in mG, and (a) holds. 
Suppose next (8) holds. Let A and H form a group pair by setting (a , - 


a,)h = (ah, --- ,a,h). Suppose 
} ain’ )h = > ai(nih, -++,9nhk) = 0. 
Then 
Dd ainih = DY njhi = 0 (all j), 


where h; = ajh. Hence, by (8), 
(S aif(n'))h = Hf hi = YX gihi = 0, 


and (b) holds. This completes the proof. 
Remark. (8) of this theorem and a similar theorem with 7 transposed say 
that 


(6.6) B“(n) = (G’,Cu(n)) if H resolves G completely, 
(6.7) Bu(n) = (H",C%(n)) if G resolves H completely. 


III. Applications to topology 


7. Cycles, boundaries, cocycles, and coboundaries. Let K be a (finite) com- 


plex, with cells of (¢ = 1, --- , a’) of dimension r, and incidence matrices 'd = 
\| a} ||. The boundary and coboundary of o; are 
: . 4 
r —1 e r+1.,i  r+l U 
(7.1) do; = » "dio; , io; = ; Ojo; . 
7 2 


In terms of the coefficients of chains, boundaries and coboundaries are given by 
(5.2) and (5.3) with } replaced by "@} and “*‘a}. ; 
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An r-X-chain is a linear form > 2:0; (x; in X); we may consider it as an ele- 
rar 
ment of X“. Define the groups of 


r-X-cycles: "C* = C*('8), 
r-X-boundaries: "B* = B*(""), 
r-X-cocycles: "Cx = Cx("*"8), 
r-X-coboundaries: "Bx = Bx(‘8). 


Then the r-X-homology groups and r-X-cohomology groups are the difference 
groups 


(7.2) . @ = . —_ 'B*, "Hy = Cc. — "Bx ° 

If G and H form a group pair, then r-G-chains and r-H-chains may be multi- 
plied, using (5.4). By (5.6), 
7.3) dA-B = A-6B 


for any (r + 1)-chain A and r-chain B. 
Equations (5.5), (6.6) and (6.7) give 


(7.4) 'C® = (G”, "Bu) if H resolves G, 
(7.5) "Ca = (HH, BY’) if G resolves H, 
(7.6) "BY = (G™, 'Cz) if H resolves G completely, 
(7.7) "Bs = (H*,'C°) _ if G resolves H completely. 


Expressed in words, we have (using (7.3)) 

THEOREM 5. With suitable G and H, an r-chain (with coefficients in G or H) 
is an r-G-cycle, or an r-G-boundary, or an r-H-cocycle, or an r-H-coboundary, if 
and only if it is orthogonal to every r-H-coboundary, or r-H-cocycle, or r-G-boundary, 
or r-G-cycle. 

THEOREM 6. An r-G-chain is a cycle, or a cocycle, if and only if it is orthogonal 
to every I-coboundary, or I-boundary. It is a boundary, or a coboundary, if 
and only if it is orthogonal mod m to every I n-cocycle, or I»-cycle, for all m (#1). 

The proof of the first half is like that of Theorem 3; the second half follows 
from Theorem 4, (a). 


8. Homology and cohomology groups. We shall find a case in which the 
homology groups are determined by the cohomology groups or vice versa. 

TueoreM 7. For any G and H, the groups "H°, "Ha form a pair, if G and H 
forma pair, as follows. Given elements & of "H° and ¢ of "Hu , choose a cycle C of & 
and a cocycle D of ¢, and set & = CD. 

Theorem 5 shows that the definition depends on £ and ¢ alone. 

TueoreM 8. Let Z be infinitely divisible. If H = Chz(G) and G resolves H 
completely, or if G = Chz(H) and H resolves G completely, then 


(8.1) "Hu = Chz('H’), or "H® = Chz('Ha). 
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We shall prove the first equation; the proof of the second is the same. Each 
element ¢ of "Hy determines a Z-character ¢; of "H® by the last theorem. Sup- 
pose ¢ # ¢’. Take cocycles D in ¢ and D’ in ¢’; then D’ — D is not a cobound- 
ary. By (7.7) there is a cycle C such that C(D’ — D) # 0,CD # CD’. Let 
— be the homology class of C; then & # &’, and ¢ # @¢;. Therefore "Hy 
determines in a (1-1) way a subgroup of the Z-characters of "H®. 

To show that this is the whole group, take any Z-character ¢ of "H®. For 
any cycle C with homology class £, set ¥(C) = $(£); this is a Z-character of 'C°. 
By the remark following Theorem 2, we may extend it to a Z-character y of 
G*’, the group of all r-G-chains. Set 


¥i(g) = (0, --- ,g,--- , 0) (g in the i-th place). 
As H = Ch2(G), there is an h; such that gh; = ¥.(g) (allg). Set 
D = (kh, --+ , har) = >> hii; 
then for any r-G-chain C = (g', --- ,g”), 
CD = Vgh = Dig’) = ¥',---,9") = WC). 


As y maps all boundaries into 0, D is a cocycle, by (7.5); let ¢ be its homology 
class. Then for any ¢ in "H*, & = $(). This completes the proof. 

Coro.tiary. ‘'H,, is isomorphic to the direct sum of the reduced r-th homology 
group and the (r — 1)-th torsion group, all with integer coefficients. 

This follows from the last theorem, Theorem 1, and Alexandroff-Hopf, To- 
pologie I, p. 234, equation (17’). A direct proof is not hard to give, using the 
ordinary elementary divisor theory. 

Remarks. The theorem does not hold for arbitrary Z. For, consider the 
projective plane, using G = R, , Z = Iz ; then H = 0, and hence "Hy, = 0. But 
*H° = torsion group of dimension 1 = Js (see Alexandroff-Hopf, loc. cit., p. 234), 
and hence Chz((H°) = CHz,I2 = Is. 

Given H, set G = Chp,(H); then H = Che,(G), and G completely resolves H, 
by Theorem 1. Hence "Hy = Che,("H*). As "H° is a topological invariant, 
"Hz is a topological invariant of K. 


APPENDIX I 


On closed complexes and pseudomanifolds 


For the general theory, see Alexandroff-Hopf, Topologie I, Ch. VII, $1. We 
refer to this work as AH. K will always denote a homogeneous” n-complex, 
and o:, o2,---, its n-cells. The chain o; means the n-chain }> 6;;0; with co- 
efficient 1 for ¢; and 0 for all other ¢;. Write C — Dif C is cohomologous to D. 


12 That is, each k-cell (k < n) is on some n-cell. 
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9. Closed complexes. We say K is closed if no ¢; is —0in K. That this 
definition agrees with the ordinary one is shown by Theorem 9 below. 

LeMMA. a; ts not — 0 if and only if it is contained in some cycle C mod m 
for some m # 1." 

By Theorem 6, o; is not — 0 if and only if for some m + 1 there is a cycle C 
mod m such that C-¢; # 0 mod m. Write C = ae; + --- ; then C-o; = a, 
and the above holds if and only if a # 0 (mod m), i.e., if and only if C con- 
tains o; . 

THEOREM 9. K is closed if and only if each o; is contained in a cycle mod m 
foranm # 1. 

This is a consequence of the lemma. Alexandroff-Hopf, p. 275, Satz la, shows 
that our definition is equivalent to the ordinary one. 


10. Irreducibly closed complexes. We say K is irreducibly closed if it is 
closed but no proper subcomplex is. 

TueoremM 10." If K is irreducibly closed, then 

(a) the cohomology group "Hz, is cyclic (of order # 1), and 

(b) no a; is — 0, but each o; is — some multiple of each o; . 

To prove (b) we shall show first that ¢; — 0in K — ¢;. If not, then by the 
lemma go; is contained in a cycle C mod m, m # 1, C in K — oa;. Let K’ be 
the complex containing the cells of C; then each of its cells is contained in C, 
and hence K’ is closed, by Theorem 9; but K’ is in K — o;, a contradiction. 
Say o; = 6A in K — o; ; then 


6A = 0 — pisoj;in K (some ij), 


and o; ™ pijo; , as required. 

To prove (a), take a o; , and let m be the smallest positive integer such that 
mo; — 0; if there is none, set m = 0. Each yo; is a cocycle (as dim(K) = n), 
and determines an element of "H;,. Further, given any element £ of "Hy, , 
determined by the cocycle C, 


C= > aan } a;(pji oi) = MO:i, 
7 7 


and £ is determined by uo;. Hence "H,, is cyclic, of order m. 

Remark. There exist complexes in which some p;; is +1; see AH, p. 280, 
Bemerkung. 

TuroreM 11.” For the homogeneous n-compler K to be irreducibly closed, 
either of the two following conditions is (necessary and) sufficient. 

(a) This is (b) of Theorem 10. 

(8) "H;,(K) # 0, but "H;,(K’) = 0 for any proper subcomplex K’ of K. 


13 The condition m # 1 could be omitted; for a cycle mod 1 contains no cells, i.e., is 
= 0 mod 1. 

144 Compare AH, p. 277, Satz IV and Satz V. 

1% See AH, Theorems on p. 280. 
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Suppose (a) holds; then K is closed. If K is not irreducibly closed, then 
there is a proper (homogeneous) subcomplex K’ which is closed. Take o; in 
K’, o;in K — K’, and say 

o; — ao; = 6A in K. 
Write 
A=A,+A2, A, in K’, A2in K — K’; 


we shall show that ¢; = 6A, in K’. As no of ‘ of As is on a cell of K’, 6Ae has 
no cells in K’; hence the part of 6A in K’ is the part of 6A; in K’, and this is o;. 
But then ¢; — 0 in K’, contradicting the assumption that K’ is closed. 

Suppose (8) holds. Let D be a cocycle not — 0. By Theorem 6, there is a 
cycle C mod m for some m # 1 with C-D # 0; hence C # 0. The cells con- 
tained in C form a closed complex K’, by Theorem 9. Now let K* be any 
proper subcomplex of K. Then "H,,(K*) = 0; consequently each o; of K* is 
— 0 in K*, and K* is not closed. Therefore K’ = K, and K is irreducibly 
closed. 


11. Pseudomanifolds. A pseudomanifold is a strongly connected” homo- 
geneous complex in which each (n — 1)-cell is on either one or two n-cells; it is 
closed if each (n — 1)-cell is on two n-cells. 

Tueorem 12." If the pseudomanifold K is not closed, then "Hz, vanishes; if it 
is closed, then it is an irreducibly closed complex, and "H,, is cyclic of order 0 or 2 
according as K is orientable or not. 

Take any succession of distinct n-cells oo, of, --- ,o:, 0; and o7+4, having the 
common (n — 1)-face o}~'. We may orient the cells so that 7" is on a} nega- 
tively and on o}:; positively. Let A be the sum of these o?'; then 


6A = co; — op. 


It follows that any two n-cells of K, with the proper orientations, are ~. 

If K is not closed (as a pseudomanifold), there is an (n — 1)-cell ¢"* on just 
one n-cell o”; then 60” = +o”, and o” — 0; hence each of — 0, and "Hy, 
vanishes. Suppose K is closed. Then each 6c” and hence each 6A”™ is a 
chain the sum of whose coefficients is even, and hence cannot be any o/ ; as each 
o; — + each o;, K is an irreducibly closed complex, by Theorem 11, (a). 

If K is not orientable, a cell of may be joined to itself by a succession of n- 
and (n — 1)-cells so as to reverse the orientation. Let A be the sum of the 
properly oriented (n — 1)-cells of the chain; then 6A = 200, so that 209 and 
hence any 2c; is — 0; hence "Hy, is of order 2. Otherwise, we may orient the 
cells of K concurrently; then each 6?” and hence each 6A” is a chain, the 
sum of whose coefficients vanishes, and no mo; is — 0 for m ¥ 0. It follows 
that "H,, is the infinite cyclic group. 


16 That is, each two n-cells are joined by a succession of n- and (n — 1)-cells. 
17 Compare AH, p. 281, Satz VIII. 
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APPENDIX II 


We shall show that Theorem 2 holds if A is any group with a finite number of 


generators. Extend f over A* as before. Let b’, --- ,b” form a base for A, 
' and a’, --- , a’, a base for A*. Say 
a’ = > nb’ (i = 1,---,8). 
gi 
Let yu; be the order of b’ (u; = 0 if b’ is of infinite order). Set 
ns =mwifj =i, = O0if j ¥i (i,j =1,---,7r); 
then 
> afb? = pid’ = 0 (i =1,---,r). 
j=1 
Set 
g = f(a’) (i = 1,---, 8), g'=0 (i =1---,r). 
Suppose > ain} = mk; (all j). Then 
te | 
>. aa" = > ps an; b? = > : e an} b’ =m > k;b’ 
i=l c=_ 71 c=] si g*1 
is in mA, and hence, using (a) or (b) (see footnote 8), 
1(X wc’) = p a; f(a’) = p> aig 
$1 é=} ‘= 
is in mG. Therefore, by Theorem 4, there are elements g', --- , 9” of G@ such 
that 
Lg =9' (j= 1,---,8 +47). 
& 


Setting f’(>> a:b’) = > aig’ defines uniquely a homomorphism of A into G; for 
f'(uid") ae nity = ” aos «@ (i i 1, ies r). 
7=1 
Further, 
fa’) =f(Lajb’) = Vag =¢ =f’) G=1,---,8), 
so that f’ is an extension of f. 


HarvARD UNIVERSITY. 











ON THE MAPS OF AN n-SPHERE INTO ANOTHER n-SPHERE 


By Hasster WHITNEY 


1. Introduction. It is well known that to each map' f of an n-sphere S” 
into another one So (n = 1 always) there corresponds a number d,; , the degree of 
f, and d; = d, if f and g are homotopic (see §2). H. Hopf’ has proved the con- 
verse theorem, that if d; = d, , then f and g are homotopic. The object of this 
note is to give an elementary proof of the latter theorem. The methods will be 
used and extended in later papers. 

In an appendix we give somewhat briefly a proof of the theorem for the case 
that d; = 0. This is the only case needed in the following paper; the general 
theorem then follows from that paper. The second proof is more intuitive 
geometrically than the first, but complete details would make it perhaps more 
lengthy. 


2. On deformations. A deformation of one space S in another Sp is a family 
o(p) (0 S t S< 1, pin S) of maps of S into So, continuous in both variables 
together. Given maps f and g of S into Sj, if there exists a deformation ¢; 
such that @ = f and ¢; = g, we say f and g are homotopic. If f is homotopic to 
g, Where g(p) = P, (all pin S), we say f is homotopic to zero, and f may be shrunk 
to the point Po. 

Suppose S and Sp are complexes, Ko is a simplicial subdivision of Sp, and 
f maps S into So. Then, for a sufficiently fine simplicial subdivision K of S, 
the following is true. To each vertex V of K we may choose a vertex g(V) of a 
cell of Ko which contains f(V), so that the vertices of any cell of K go into the 
vertices of a cell of Ky. This determines uniquely a “simplicial map” g of K 
into Ko , affine in each cell (see §5); moreover, f is homotopic to g. 


3. The degree of a map. Let S) be the unit n-sphere in (n + 1)-space, let 
K¢ be a simplicial triangulation of Sj, and let o> be an n-cell of Kg. We choose 
K¢ so that if P; is a point of o} and P» is the antipodal point of So, each great 
semicircle from P,; to Py intersects the boundary d0¢ of o¢ in exactly one point. 
By pushing along these semicircles, we define a deformation Q, of the identity 
Q(p) = pinto a map Q, , where Q;(p) = Py for pin Sp — ao. 

Let o* be a k-cell (k < n), in fixed correspondence with a k-simplex, and let 


Received December 1, 1936; presented to the American Mathematical Society, October, 
1936. 

1 All maps will be assumed continuous. 

2? See Alexandroff-Hopf, Topologie, I, Berlin, 1935, pp. 501-505. See also the reference 
to Lefschetz in the following paper. 
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f map o* into St. We say f is standard if f(p) = Po, or, k = n and for some 
affine map ¢ of o* = o" into «3, f(p) =2,(¢(p)). In any case, f(p) = Py in ao*. 
The map f of an n-complex K" into S¢ is standard if it is standard over each 
k-cell (k S n). 

We may orient So by orienting o¢. Let K” be a simplicial triangulation of 
the oriented n-sphere S", and let f be a standard map of K" into Sj. Let o” 
be an (oriented) n-cell of K”. If f(p) = Po in o", we set d;(o") = 0. Other- 
wise, there is a simplicial map ¢ of o” into (the whole of) o¢ such that f(p) = 
2,(¢(p)) in o”; we set d;(o") = 1 or —1 according as ¢ is positive or negative. 
We define the degree of f by 


(3.1) dy = DX 4,(o"). 


4. The theorem. In homology theory it is shown how to attach to each map 
f of S" into S¢ (both spheres oriented) an integer d; , the degree of the map. 
Moreover, if f is homotopic to g, then d; = d, , and if S" and S¢ are triangulated 
and f is standard, then d; is given by (3.1). 

Suppose f and g map S” into Sj, and d; = d,. Then for a sufficiently fine 
subdivision K" of S", both f and g can be deformed into simplicial maps and 
hence into standard maps ¢@ and y. As f and ¢, also g and y, are homotopic, 
d, = d,. By Theorem 1 below, ¢ is homotopic to y; hence f is homotopic to g. 
Therefore this theorem furnishes the converse of the statements above. 

TuHeoreEM 1. If ¢ and y are standard maps of S" into Sj, using the same sub- 
division K" of S", and dy = d, , then is homotopic to y. 

From the proof below, the following corollary is apparent. 

Corotuary. If o(V) = ¥(V) = Po for a fixed vertex V of K", we can make V 
remain at Po throughout the deformation. 

In fact, if nm = 2, all vertices of K" remain at Py. If n = 1, we may choose 
the chains of cells in §8 so that in no chain do we pass over V; then V is never 
moved. 

THEeorEM 2. For any integer y there is a map f of S" into Sj with d; = y. 

To prove this, subdivide S" into a = |y|{ n-cells. Let @ map | y | of these 
cells simplicially into o¢, positively or negatively according as y > 0 or y < 0 
(if y * 0), and set f(p) = 2,(¢(p)) in these cells and f(p) = Po elsewhere. 
Clearly d; = y. Note that the degree of the identity map of S¢ into itself is 1. 

The remainder of the paper is devoted to the proof of Theorem 1. 


5. Coérdinates p, in a cell. Any simplicial complex K” is homeomorphic 
to a complex K" in euclidean space whose cells are straight. Using K", we de- 
fine straightness in K", the center of a cell (i.e., center of mass of its vertices), 
ete. Hence an “affine map” of one cell into another has meaning. Let o be 
a cell of K", and a, the center of ¢. For each point p of the boundary de of 
let p be the point of the segment ap such that ap,/ap = t. 


* There-are (n + 1)! standard maps ¢ of o” into S} with o(p) # Po. 
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6. Certain deformations of simplexes. We prove first a combinatorial lemma, 
needed in Lemma 2. 

Lemma 1. Any even permutation of the letters aga, --- a, (n 2 2) may be made 
by means of a succession of cyclic permutations, each on three of the letters. 

This is clear if n = 2; then any even permutation is cyclic. Suppose n > 2, 
and let B = da, --+ da, be any even permutation. If a, 7 n, bring ag, to the 
right end by a cyclic permutation; bring a,,_, next to a,,. Suppose ay + 0. 
We then perform the two cyclic permutations 


a; 


Ap +++ Gay *** Bag; Gay —? Bay *** Ban, *** Ala, —> Bay *** M0 *** Bay Ban; - 


If n = 4 and ag, is not now in the second place, we perform two cyclic permuta- 
tions to bring it there, again interchanging a.,_, and a,,, ete. When a,,, ---, 
@q,-, are in their correct places, aq,_, is also; as B is even and the above permuta- 
tions are even, @@,_, and aq, are also in their correct positions. 

Lemma 2. Let o” = a--- a, be a simplex, and let aa, --- Ga, be an even 
permutation of its vertices. Then there is a deformation ¢: of o” in itself, such that 
do(p) = p, or(@s) = Ga, , o: 18 affine, and ¢; for each t is a homeomorphism both in 
o” and in its boundary. 

If n = 0 or 1, the lemma is trivial. Suppose that n = 2; say @a,@e,da, = 
ayaea,. Let ¢(a;) be the point p of aaj, (setting 2 + 1 = 0) for which 
ajp/aai., = t. Let @& map the segment a;a;,, into the broken line ¢$,(a;)aiz1¢: 
(ai,1) so that, if the line were straightened, the map would be linear. For any 
point p, (see §5) interior to o”, set ¢:(p.) = (@(p)).. As (a) = a = center 
of mass of o”, ¢; is easily seen to be affine. 

Now suppose n > 2; consider first a cyclic permutation, changing say aoa;d2 
into ayaea). Set ¢ = apaya2, o’ = az --- a,, and let [p, q, u] for pin o, gino’, 
0 < u S 1, be the point r of the segment pq for which pr/pqg = u. Define ¢, in ¢ 
as above. For any point [p, q, u] not in o, set @[p, q, u] = [¢:(p), g, ul. We 
show that ¢@ is a homeomorphism. Suppose ¢:[p, q, u] = @&[p’, q’, u’]; then 
[o(p), g, ul = [¢:(p’), 9’, w’], which implies ¢(p) = ¢:(p’), q = q’, u = w’;' as 
¢: is a homeomorphism in ¢, p = p’ also. Further, given [p, q, u] and t, we may 
find a p* for which ¢(p*) = p; then ¢[p*, q, u] = [p, q, u]. The other proper- 
ties of ¢ are clear, and the lemma for this case is proved. Now take any permu- 
tation. We may obtain it by cyclic permutations as in Lemma 2; the cor- 
responding deformations together give the required deformation. 


7. Two types of deformations of S" in Sj. Let ¢’ be a standard map of S” 
into Sj, and let o and o’ be oriented n-cells of K" with the common (n — 1)- 
face r: 


, 
o = AQ, ---a,, o' = —Qot --- Ga, T= Q--+ an. 


(a) Suppose dy-(¢) = 1, dy-(o’) = 0; we shall deform ¢’ into ¢” so that d,--(¢) 


= 0, dy-(o’) = 1, leaving K" — (¢ + o’) fixed. 
(b) Suppose dy:(¢) = 1, dg-(o’) = —1; we shall obtain dy-(¢) = dg(o’) = 0. 


‘This is so if 0 < u < 1, as we may assume. 
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In each case ¢” will be a standard map. 
, , . , ‘ 
(a) Set o; = dode --- Gn, 0, = Qod2 --- a,, orif n = 1, then o, = a, o; = ap. 
. , . . 
Let 6, and 6: be the affine maps of o; into 7 and o; determined by sending ao 
. , . . . 
into a; and ap respectively. For each p in o; , let a(p, u) run linearly along the 


segments p@,(p) and 6;(p)62(p) as u runs from 0 to 1 and from 1 to2. Set 


» , _ j¢'la(p, u — 0] (tS u), 
(7.1) dila(p, u)] = re 0)] (t > u), 
and ¢:(p) = ¢’(p) in K" — (0 + 0’). As ¢'(p) = Poin G0; + doz , this is clearly 
a deformation of ¢’ = ¢ into a map ¢” = ¢. The map ¢” in o’ is obtained 
from the map ¢’ in o by replacing a, a1, --- , a, (which form +¢) by a, ao, 

- , a, (which form +o’); hence dy(e’) = dy-(c). Also dy-(c) = 0 as ¢’(p) = 
Po in oa, and (a) is proved. 
(b) Let \ and 2’ be the affine maps of ¢ and o’ into o such that ¢/(p) = 
(A(p)) in ¢ and = Q,(A\’'(p)) ino’. Say of = bo --- bn, 
A(ai) = bk; , and \’(ao) = bin 5 (ai) = bi, (¢ > 0). 
As dy(o’) = —d,s-(—o’), and hence 


ds(c) = ds:(ao ay eee a,) = —d,:(o’) = ds(ao ay cee Gn), 


bi, --- by, is an even permutation of bj, --- bk, . Applying Lemma 2, we find 
a deformation ; of o’ in of such that \j = X’, A; is affine, and 


(7.2) Xi(ao) = A(ao), Ai(a;) = d(a,) (i > 0). 
Set 

1. _ [2% (dt(p)) p ino’, 
(7.3) o:(p) _ ‘ep. p in K" —@’. 


Then as 2,(A;(p)) = Po in do’, ¢: is a deformation of ¢’ into a map ¢* = ¢). 

For each p in +, let B(p, u) be the point q of the segment aop of « such that 
aoq/aop = u, and let B’(p, u) be the corresponding point of the segment agp in o’. 
As ) and }j are affine, (7.2) and (7.3) give 
(7.4) $*[B(p, u)] = o*[8'(p, u)] (pin 7,0 S u S 1). 
We deform ¢* into ¢” by setting 


(7.5) $7 [8(p, u)] = o7[6'(p, u)] = o*18(p, (1 — du)), 


and ¢;(p) = ¢*(p) in K" — (¢ + o’). This is clearly a deformation; (7.4) 
shows that ¢ = ¢*. As ¢”" (p) = o1(p) = Poing + o’, dg(c) = dgr(o’) = 0. 

8. Proof of Theorem 1. Suppose there are cells of K" mapped positively 
over So by ¢, and also cells mapped negatively. Then we can find a chain 


oo, --+,¢, 0f adjacent n-cells of K” such that 


ds(o) = 1, ds(o1) saa 0, rR i ds(¢,-1) = 0, ds(o,) = -1l. 
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Using (a), §7, we deform ¢ in a) + o , then in o; + a2, ete.; then, using (b), §7, 
we deform the map in o,., + o,. The new map @¢’ has d,-(e;) = 0 (¢ = 0, 

, v). Continue in this manner till no cells are mapped positively or none 
are mapped negatively over So; for definiteness, say the latter holds. Do the 
same for y. The new maps ¢* and ¥* each have exactly d, = dy cells mapped 
positively over Sj. 


Suppose d,-(o) # dy-(c) for some ¢. Then let oo, 01, --- , 0, be a chain of 
adjacent n-cells such that 
ds+(ao) = dy+(a,) = :. ds+(o,) = dy+(a0) = 0, 
dse(o;) = dy+(o;) (0<i< >»). 
Let oo , ox, , ++ , ox, be the cells of the chain for which dj- = 1. Using (a), §7, 
we deform $* over ox, + ox%,41 ete. until we have dsi(ox,) = 0, dst(e,) = 1; 
another succession of deformations makes d,3(ox,_,) = 0, dsi(ox,) = 1, ete. 


Finally dg+*(oo) = 0, dgee(ox;) = 1 (all 7), and dy+(¢,) = 1. dy-s(c) differs from 
dy+(c) over fewer cells than d,-(¢). Continuing in this manner, we deform ¢* 
into a map ¢’ with dy-(c) = d,-(c), all ¢. ’ and y* are standard. Applying 
Lemma 2, we deform ¢’ over each n-cell where necessary, to obtain y*. (Com- 
pare the first half of the proof of (b), §7.) This completes the proof. 


Appendix’ 


Let f be a map of S” into Sj with the degree 0. We first deform it into a 
simplicial map and then into a standard map ¢ (see §§ 2, 3). To shrink ¢ to 
a point is equivalent to extending ¢ through the interior R of S” (see the fol- 
lowing paper, § 4). Let o1,---,0, and o;,---,o,. be the simplexes of S” 
mapped positively and negatively over Sj respectively. Let 7; be a tube join- 
ing o; to o; inside R. We may choose these so no two intersect, and also (to 
prove the corollary) so no one cuts the radius of R to the vertex V. Let ao -- - a, 
and ap +--+ a» be positive and negative orientations of o; and o: respectively, 
such that \(a;) = b; and \’(a;) = b; determine simplicial maps of o; and o, into 
oo, Which in turn determine ¢ in o; and o;. Now carry o; through T; to o:, 
turning it so that a; goes into a; ; let g,(;) be the position of ¢; after the time ¢. 
We do this so that g:(¢;) does not intersect g:-(o;) if t # t’. (We are using a 
deformation theorem on simplexes in euclidean space, similar to but simpler than 
Lemma 2.) The definition of ¢ in Ris as follows. For p not in any g;(o;), set 
¢(p) = Po. For p ing:(o;), choose q in o; so that p = g.(q), and set (p) = $(q). 


HARVARD UNIVERSITY. 


5 Added in proof. 
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THE MAPS OF AN n-COMPLEX INTO AN n-SPHERE 
By Hasster WHITNEY 


1. Introduction. The classes of maps of an n-complex into an n-sphere 
were classified by H. Hopf’ in 1932. Recently, W. Hurewicz’ has extended 
the theorem by replacing the n-sphere by much more general spaces. Freu- 
denthal’ and Steenrod‘ have noted that the theorem and proof are simplified 
by using real numbers reduced mod 1 in place of integers as coefficients in the 
chains considered. We shall give here a statement of the theorem which seems 
the most natural; the proof is quite simple. As in the original proof by Hopf, 
we shall base it on a more general extension theorem. 

The fundamental tool of the paper is the relation of “coboundary”;’ it has 
come into prominence in the last few years. 

In later papers we shall classify the maps of a 3-complex into a 2-sphere and 
of an n-complex into projective n-space. 


I. Elementary facts 


2. Boundaries and coboundaries. Let K be a complex, with oriented cells of 
(not necessarily simplicial) of dimension r, r = 0, --- ,n. Let aj; = 1, —1,or0 
according as 0; ' is positively, negatively, or not at all, on the boundary of oj. 
An r-chain C’ is a linear form Lajoj, the a; being integers (or elements of an 
abelian group). The boundary (or contraboundary) and coboundary of C’ are 
defined by 


(2.1) a( ao) = > Qi as05; a a0%) = > Qj 85;'6;". 


As in the ordinary theory, we say C’ is a cocycle if its coboundary vanishes, 
and C’ is cohomologous to D', C’ — D’, if C’ — D’ isacoboundary. The relation 
56C” = 0 (easily proved; equivalent to d@C" = 0) says that every coboundary 


Received December 1, 1936; presented to the International Topological Conference in 
Moscow, September 1935, and to the American Mathematical Society, October 1935. 

1H. Hopf, Commentarii Mathematici Helvetici, vol. 5 (1932), pp. 39-54. See also 
Alexandroff-Hopf, Topologie I, Ch. XIII. A recent proof has been given by 8. Lefschetz, 
Fund. Math., vol. 27 (1936), pp. 94-115. In Lemma 3 he gives a new proof of the theorem 
of the preceding paper; the author does not understand how the final map is made 
simplicial. 

2 W. Hurewicz, Proc. Kén. Akad. Wet. Amsterdam, vols. 38-39 (1935-36); in particular, 
vol. 39, pp. 117-126. The full paper will appear in the Annals of Math. 

3H. Freudenthal, Compositio Math., vol. 2 (1935), footnote 8. 

4 Unpublished. 

5 This is discussed briefly in §2. For further details, see our paper On matrices of in- 
tegers, pp. 35-45 of this volume of this Journal. We refer to this paperasI. The relation 
of Theorems 2, 3 and 4 to the theorems as stated by Hopf are made apparent by the theo- 
rems in I. The present paper is independent of I. 
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isacocycle. Hence we may define the difference group of the group of r-cocycles 
P . 6 
over the group of r-boundaries, forming the r-th cohomology group. 


3. Normal maps of cells into S;. Let Sj be the (oriented) unit n-sphere in 
(n + 1)-space. Let f map the (oriented) n-cell o" into S$. We say f is normal 
if f(p) = Po , a fixed point of So, for p in the boundary do" of «”. This is equiva- 
lent to identifying the points of ao” in o”, forming an n-sphere S", and mapping 
this sphere into Sj. Hence we may define the degree’ d;(o"). If f and g are 
normal in o” and d,;(o") = d,(o"), then we may deform f into g, keeping aco” 
at Po, by II, corollary. 

Any map f of o’ into Sj, r < n, may be shrunk to Po: we deform f into a 
simplicial map, and apply ©, (see II, §3). Po» being assumed a vertex of Ko, 
if do’ is at Po it remains there during the deformation. 

If K is any complex, let K’ be the subcomplex of K containing all its cells of 
dimension S$ r. The map f of K into Sf is normal if f(p) = Po for p in K”™. 
Suppose o” or S” is subdivided into cells ¢?, and f is a normal map of it into S?. 
Then the d;(o?) are defined, and 


(3.1) do") or df(S") = : d,;(a?). 


To show this, subdivide o” or S” further, so that we can deform f into a sim- 
plicial map, and apply Q, (see II, §3). The above quantities are unchanged, 
and (3.1) is now a consequence of II, (3.1). 


4. On deformations. We shall need the following elementary results. Let 
K X I be the product of K and the unit interval J, consisting of all pairs (p, 2), 
pinK,0<t<1. The deformation ¢,(p) of K in S¢ is equivalent to the map 
&(p, t) = o(p) of K X I into Sj. Hence ¢ is homotopic to ¢, if and only if , 
defined over K X 0 + K X 1, may be extended over K X I. 

Let f map the boundary @o’ of o’ into Sj. Then f is homotopic to zero (in do’) 
if and only if it may be extended through o’. For the deformation f,(p) (p in 
ac’) into fi(p) = P is equivalent to the map f(pi_,) (see II, §5) = f:(p) of o 
into So. 

Lemmal. If ¢ = ¢ maps o” into S, and the deformation ¢, of > is defined over 
do", then its definition may be extended over o”. 

We define ¢,; in o” by 


1 
lati (0 sits 1), 
(4.1) o(Pu) = § : 
[Pees-2”) ¢ =v2“e0¢ 1), 
6 This is the character group of the homology group with numbers mod 1 as coefficient 
group. 
7 See pp. 46-50 of this volume of this Journal; we refer to this paper as IT. 
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LemMa 2. Any map ¢ of K into Sj may be deformed into a normal one; all cells 
already at Py we may keep fixed. 

We deform the map successively so that K’, K',--.,K" ‘are at Py. Sup- 
pose K”' is at Py (if 0 <r <n). As each ao’ is at Py, we may deform each 
o into Py, keeping a0" at Po (see §3). This deformation, defined over K’, is 
extended over all (r + 1)-cells, (r + 2)-cells, etc., by Lemma 1. _ It is now de- 
fined over K, and K’ is at Po. 


5. Parts of cocycles. Let K’ be a subcomplex of K. Any r-chain C of K 
may be written C’ + C’’, the coefficients of cells of K — K” [of K’] being zero 
in C’ [in C’’]. We say C’ is part of C. Clearly the chain C’ in K’ is part of a 
cocycle if and only if 6C’ cobounds in K — K’, i.e., if and only if for some chain 
C” in K — K’,6C’ = 6C”. The (r + 1)-chains are chains of K. 


6. The product K X J. We subdivide K X I (see §4) by means of all cells 
o; X I(c;in K). Orient the cells oj X 0 and o; X 1 like the o; , and orient each 
(r + 1)-cell oj X I so that of X 1 is on its boundary positively. Then 


(6.1) do; X 0) = -ej} XI+---, io; X1) =o; XI4+---, 


(6.2) (oi X I) = —Do aij"(o5" X DD. 
2 
To prove (6.2), say 6(0; X I) = Ajj'(o}*' XI) +--+. Then 
66(o; X 1) = 


6l(oi X 1) + YS aij (oj X I] 


(Ai + a;7")oj" XD +--+. =0, 

and Aj;' = —@j;'. The first equation in (6.1) is clear for r = 0; it is proved in 
succession for r = 1, 2, - - - by considering the coefficient of ¢; X J in 8(o; ' X 0). 

TuHEoreM 1. Let Cy and C, be n-chains in K = K”, and let Do and D, be the 
corresponding chains in K X Oand K X 1. Then Do + D; (asachainin K X I) 
is part of a cocycle if and only if Co — C, in K. 

Say 

C= > aio?, C, = > bio}. 


Consider any n-chain 
(6.3) D=D.+Di+ Vhioj” x 1); 
then, by (6.1) and (6.2), 
6D = —Yiao? X 1) + DY bilo? X 1) — Y hj ajo? X DY) 
> [b: — a; — > h; a7;\(o? X D). 


(6.4) 


8 K — K’ isin general not a subcomplex of K, i.e., is not closed in K. 
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Suppose Dy) + D, is part of a cocycle D; then (6.4) set = 0 gives 
a( hei) = > hydro? = » (b; _ a;)o; = C; - Co; 
| t47 i 


and Cy —C,. Conversely, suppose C; — Cy = 6(>>hjo}~"); then the last set of 
equations shows that the bracket in (6.4) vanishes, and hence D, defined by 
(6.3), is a cocycle. 


II. The theorems 


7. The extension theorem. We shall prove 

TueoreM 2. Let f be a normal map of the subcomplex K' of K = K""* into S>. 
Then f can be extended over K if and only if the chain 
(7.1) D'= DY doz )o? 


eT 
o; in K 


in K’ is part of a cocycle. 
First suppose D’ is part of a cocycle D = >> ajo? : 


(7.2) a; = d,(a;) (of in K’), p a;07;' = 0 (all j). 


f maps (K’)"” into Py ; set f(p) = Po in K"*. Let f map each of not in K’ 
into Sj with the degree a; (see II, Theorem 2); then (7.2) holds for all o?. Con- 
sider any (n + 1)-cell 0?“ of K — K’. Using (3.1), we find 


d(ao}"') = d(X ati'e?) = D aij" d/o?) 


7.3 

‘ , = z af; a; = 0. 

Hence f, considered only in 4¢}*', is homotopic to zero (II, Theorem 1), and f 
may be extended over a? ** (see §4). Thus we extend f throughout K. 

Now suppose f is extended throughout K. By Lemma 2, we deform f into 
a normal map, leaving (K’)""', and hence also K’, fixed. Call the new map f 
again, and define the a; and D by (7.2). Then D’ is part of D. By §4, f, in 
each da)", is homotopic to zero; hence (7.3) holds, and D is a cocycle. 

Remark. If f is any map of K’ into Sj, we may deform it into a normal map 
@, by Lemma 2. From Lemma 1, it is apparent that f can be extended over K 
if and only if @ can be. Define D’ by (7.1). By Theorem 2, 6D’ has zero co- 
efficients over cells of K’, and is therefore a chain, which is clearly a cocycle, of 
K” = K — K’. By Theorem 3, Remark, if f is also deformed into the normal 
map yw, defining the chain C’ of K’, then C’ — D’ in K’, and hence for some 
H in K’, 


C’ — D’ = (6H)’ = 6H — (6H)”. 


Therefore 6C’ — 6D’ = 4{(6H)’’], which lies in K”. Thus the cohomology class 
in K” of 6D’ is uniquely determined by f, and we have (using Theorem 2): f may 
be extended over K if and only if its cohomology class thus defined in K"” is ~ 0 
in K”’. 
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8. The classes of maps of K" into S;. If we put two maps of K" into S? 
into the same class if they are homotopic, the maps fall into classes, the homotopy 
classes. To any normal map f of K” into Sj we let correspond a chain C; as 
in (7.1). 

THEOREM 3. The normal maps ¢ and y of K = K”" into So are homotopic if 
and only if Cs — Cy . 

Set (p X 0) = o(p), P(p X 1) = ¥(p); then ¢ is homotopic to y if and only 
if @ may be extended through K X I (see §4). If Do and D, correspond to 
C, and Cy in K X 0 and K X 1, Theorem 2 shows that this is possible if and 
only if D’ = Do + D, is part of a cocyele in K X I. By Theorem 1, this is true 
if and only if Cs — C,. 

Remark. If K is of any dimension and ¢ and y¥ are homotopic, then C, and 
Cy, are cocycles and C,; — Cy. The first statement follows from Theorem 2; 
the second follows on considering ¢ and y in K" alone. 

TuHeoreM 4. The classes of maps of K" into Sj are in (1 — 1) correspondence 
with the elements of the n-th cohomology group of K with integer coefficients. The 
correspondence is given by deforming the map f into a normal one and taking the 
cohomology class of the resulting cocycle. In particular, f is homotopic to zero if 
and only if the corresponding cohomology class is zero. 

The deformation is possible, by Lemma 2. The cohomology class is uniquely 
determined by f, and non-homotopic maps determine different classes, by 
Theorem 3. Finally, to each cohomology class corresponds a map; we take a 
cocycle C of the class, and let f map each o” normally into Sj with the degree 
equal to its coefficient in C (see II, Theorem 2). 


9. The Theorem of Hurewicz. Let Q» be a fixed point of a space S. Then 
the classes of maps of So into S for which Py goes into Qp» form an abelian group, 
the r-th homotopy group of S.° If f maps o” [or Sj] into S, and f(p) = Qp in 
do"[f(Po) = Qo], we may call the corresponding homotopy element the degree 
d;(o")[d;(S¢)] of f. (If S = So, the n-th homotopy group is the group of integers, 
as was seen in II, so that this is a natural generalization of the term degree.) 
The fundamental formula (3.1) holds still. The theorems of the preceding 
paper become matters of definition. The proofs in the present paper hold with- 
out change, and we have a new version of the Theorem of Hurewicz: 

TueoreM 5. Theorems 2, 3 and 4 hold if we replace Sj by any locally con- 
tractible space So whose r-th homotopy groups vanish for r < n, and replace the 
integers by the n-th homotopy group of S as coefficient group in the chains and co- 
homology classes. 

Hurewicz also shows that in the above space Sp the n-th homotopy group is 
the same as the n-th homology group with integer coefficients. 


Harvarp UNIVERSITY. 


*See Hurewicz, loc. cit. We assume a knowledge of the fundamental properties of 
homotopy groups. 





ON THE EXTREME POINTS OF CONVEX SETS 
By G. BaLey Price 


Introduction. A convex set is a set such that if it contains two points, it 
contains the segment joining these points [1, p. 2, and 2. Numbers in square 
brackets refer to the bibliography at the end]. Minkowski defined certain 
points of convex sets which he called extreme points [1, pp. 15-16; 3, p. 157]. 
They are related to certain other points which are here called extreme points 
in the sense of distance to distinguish them from the former, which are called 
extreme points in the sense of Minkowski. A detailed study is made of these 
two types of extreme points of convex sets in abstract normed linear spaces. 

In the first place, it is necessary to distinguish two types of normed linear 
spaces on the basis of the convexity properties of spherical neighborhoods ($1). 
A normed linear space such that the segment joining any two points of a spherical 
neighborhood is interior to the neighborhood except at most for the given points 
themselves is called a space L*. All other normed linear spaces are classed 
together and denoted by L. The study of extreme points is far simpler in spaces 
L* than in spaces L, and the results are more complete. An example con- 
sidered in §10 shows that the property of being a space L* may depend on the 
properties of the distance function alone and not on the linearity properties 
of the space. 

In §2 the existence of extreme points is considered. An approximation 
theorem first proved by Minkowski for euclidean 3-space is extended to spaces 
L* and L in $3. Two kinds of convex sets are distinguished in §4 on the basis 
of the relation of the two kinds of extreme points, and it is shown that the set 
of extreme points in the sense of Minkowski may be either closed or not closed. 
In §$5 the closed convex hull of a given set is considered, and Minkowski’s Ap- 
proximation Theorem (§3) is extended. 

A general theorem on compact sets is established in §6. It is shown that in a 
complete metric space a set is compact if it is possible to approximate uniformly to 
it by means of closed compact sets. This theorem and Minkowski’s Approxima- 
tion Theorem (§5) enable us to show in §7 that the closed convex hull of a 
compact set in a Banach space is compact. 

The significance of Minkowski’s Approximation Theorem is considered briefly 
in §8. A series of theorems is given in §9 which establish more precisely the 
relation between a convex set and its extreme points. Some examples are 
considered in $10. 

The paper may be considered a study in the geometry of abstract space. 


1. Linear spaces and extreme points. A space which is linear and normed 
will be designated by L, its elements or points represented by z, y, --- , and 
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the distance between x and y by |\2 — y||. Any spherical neighborhood 
\||2 — xo || S ris convex, since the distance function satisfies the triangle in- 
equality. If the space L has the additional property that no point of the 
boundary || x — 20 || = r of the sphere || x — 2» || S ris an interior point of a 
segment joining two other points of the sphere, it will be designated by L* and 
called a space L with non-flat spherical boundaries. It can be shown at once 
that no point of the boundary of a sphere is an interior point of a segment join- 
ing two interior points of the sphere. 

Any euclidean space of n dimensions is a space L*. A sufficient condition 
that a space L be a space L* is that the distance function have the properties 
of the distance function in Hilbert space. 

There are spaces L which are not L*; for example, the space of continuous 
functions f(s), a < s S b, in which the distance between f(s) and g(s) is max 
| f(s) — g(s)|ona Ss 2b. For consider the functions f(s) such that | f(s) | < 
M. They form a spherical neighborhood, and two functions f(s), g(s) in it such 
that f(a) = g(a) = M are onits boundary. But all the functions 6f(s) + (1 — @) 
g(s), 0 S @ S 1, are also contained in the boundary. This space has flat spheri- 
cal boundaries, i.e., they contain segments. 

We shall now define two kinds of extreme points of convex sets. 

(1) An extreme point in the sense of Minkowski (3, p. 157] of a conver set C 
is a point xy which is not an interior point of any segment joining two points of C. 

(2) An extreme point in the sense of distance with respect to a point y of a convex 
set C is a point xp(y) of C whose distance from y is a maximum. 

As a consequence of the definitions which we have made, we have at once 
the following theorem. 

THEOREM 1.1. Ina space L* every point of the boundary of a sphere is an ex- 
treme point in the sense of Minkowski. 

Notation. In the future C will designate a set which is closed, compact, and 
convex. The set of extreme points ry of C will be designated by Ey, and the 
set of extreme points xp of C with respect to the points of a set S by Ep(S). 





2. Existence of extreme points. We proceed to establish the existence of the 
extreme points which we have defined. 

THEOREM 2.1. There exists at least one extreme point xrp(y) of any set C with 
respect to each point y. 

This theorem is obvious, for the distance from y to a point x of C is a contin- 
uous function of x, defined for z on a closed, compact set C. 

THioreM 2.2. The set of extreme points Ep(y) of C for a fixed y is compact 
and closed. 

Since Ep(y) is a subset of C, it is compact. That it is closed follows from 
the remarks made in proof of the last theorem. 

Remark. Every set C contains at least one pair of points 2, x2 such that 
Ep(z:) contains x2 and Ep(x2) contains 2,;. The segment joining such a pair 
of points is a diameter of the set. 
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THEeorEM 2.3. In a space L* any point xp of Ep(S) is also a point xm of Ey. 

The point zp is related to a point y of S in such a way that the sphere with 
center y and radius || 7» — y || has C in its interior except for certain points on 
its boundary. From the definition of a space L* it follows that xp belongs to Ey. 

This theorem together with Theorem 2.1 proves that the set Ey of a set C 
in a space L* is not empty. 


3. Minkowski’s Approximation Theorem. In this section we shall establish 
for spaces L* a theorem first proved by Minkowski [3, p. 160] for euclidean 
3-space, and we shall give its generalization in spaces L. 

The convex hull of a finite set of points 2, --- ,2, will be called a poly- 
hedron P,,. Its set of extreme points Ey is composed of the points themselves 
or a subset of them. 

Lemma 3.1. Let a set C in L be given and also the parallel set C’ at distance 
d > 0. No segment which joins a point of the boundary of C’ to an interior point 
of C’ contains a second boundary point of C’. 

The parallel set C’ at distance d from C may be defined as the set composed 
of all points whose distance from C is equal to or less than d (in this case the 
notation does not imply that C’ is compact). Let 2; be any point of the bound- 
ary of C’, and 2x2 any interior point. Then there exist points y; and ye of C 
such that || 71 — y: |! = d, || a — ye || <d. The distance from 6x; + (1 — 6)z2 
to C is not greater than its distance to 6y, + (1 — @)ye. But 


|| (Ox, + (1 — 0)x2} — {Oy + (1 — O)y2} || 
SO\|a—m || + (1 — 4) || 22 -—yl|| <d 


for 0 S 6 < 1. Thus the only point of the segment on the boundary of C’ 
is x, ,and the lemma is established. 

THEOREM 3.1. Ina space L* let any set C with extreme points Ey be given and 
anye > 0. Then there exist an N(e€) and a sequence of polyhedra P,, P2, --- with 
Pi, C P2C --- , and with all the extreme points in the sense of Minkowski of each 
one contained in Ey, such that for n = N(e) the distance from any point of C to 
P,, is less than e. 

Let D(z, P,,) denote the distance from zx to P, ; it is a continuous function of 
x, for | D(z, P,) — Dy, P,)| S || x -— y|). 

Let x; be any point of Ey, and let z, be any point of the set Ep(2,); at least 
one exists by Theorem 2.1. Consider D(x, P2) for x on C. It is a continuous 
function which is defined on a closed, compact set, and which vanishes on P2. 
Unless C is identical with P2, there is a set of points X; C C, but X; € Po, 
at which it takes on its maximum value d; > 0. The set X3 is closed and com- 
pact and lies on the boundary of S2, the parallel set to P: at the distance d;, 
and we shall show that it contains at least one point z; of Ey. For let x3 be a 
point of X; whose distance from x2 is maximum; such a one exists and || 23; — 2: || 
> 0. Now 2; is an end point of any segment of C which contains x; and a point 
of C interior to S. by Lemma 3.1; all points of C are interior to S2: except those 
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of X;, which are on its boundary. Also zs cannot be an interior point of a 
segment which contains only points of X;, since these points lie in the interior or 
on the boundary of the sphere || « — 22 || < || 23 — x2 ||. We make use of the 
fact here that spherical boundaries in a space L* are non-flat. It follows from 
the arguments given that 2; « Ey. 

We observe next that D(a, P:) 2 D(2x3, P2), for D(xs, P:) 2 D(as, Po), 
since P; C P2, and D(a2, P:) = D(as, P:), since x2 is a point of Ep(2x,). 

Suppose now that this process has been repeated until n points 2, --- , 2, 
of Ey have been obtained with D(a, P:) 2 D(a3, P2) 2 --- 2 Dan, Poa). 
Then it can be repeated to obtain a point 2,4: of Ew unless C is identical with 
P,. For since C contains points distinct from P,, D(x, P,) for x on C takes 
on its maximum value d,,; > 0 at a set of points X,., CC, but X,u. € Py. 
The set X,4; is closed and compact and lies on the boundary of the parallel 
set S, to P, at the distance d,,,. The same arguments that were used before 
will show that a point z,.,, of X,,; whose distance from rz, is a maximum belongs 
to Ey. 

Also D(a,, Pai) 2 D(tasi, Pr), for D(atnsi, Pai) 2 D(tasi, Px) since 
Pi, CP,, and D(a,, Pra) 2 D(tns1, Pn) because of the definition of r,. 

Thus either there exists a value of n such that C is identical with P,, or there 


is an infinite sequence of points 2, 22, --- of Ey with the corresponding poly- 
hedra P,, P2, --- and 
(3.1) D(x2, P1) = D(23, P2) 2---20. 


Also it is clear that D(x,, Px) 2 D(a,, Pas) for k = 1,---,n — 1 (since 
P, C P,_1), from which it follows that 


(3.2) || tn — || 2 D€rn, Pr) (k=1,---,n—I1;n = 2,3,---). 


The proof will be complete if we can show that the sequence of numbers in 
(3.1) approaches zero. But if they do not, they have a limit 6 > 0 and 


(3.3) D(x», Pra) 26 


for all values of n. Also the sequence 2, 22, --- has at least one limit point, 
since its points belong to the compact set C. Then it is possible to select a sub- 
sequence which approaches this limit. But no such sub-sequence can approach 
a limit, because (3.3) and (3.2) show that the distance from any point of it to 
all the preceding is not less than 6. Thus the assumption has led to a contra- 
diction, and the sequence of numbers in (3.1) approaches zero. The proof of the 
theorem is complete. 

An examination of the proof just given will show that the approximating 
properties of the polyhedra P,, P2, --- are in no way dependent on the fact 
that the points 2, 22, --- belong to Ey. The hypothesis that C is in L* was 
used only to establish the fact that 2, 22, --- belong to Ey. It will be seen 
at once that this proof establishes also the following theorem. 

THEOREM 3.2. Ina space L let any set C and any « > 0 be given. Then there 
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exist an N(e) and a sequence of polyhedra P,, P2,--- with P; CP, C---, 
all of which are contained in C, such that for n = N(e) the distance from any point 
of C to P,, is equal to or less than e. 

Remark 3.1. The set C is the limit of the sequence of polyhedra P;, Pe, -- - 
in two senses: (a) take all points x such that ze P,, for n = N(x) and close this 
set by adding its limit points; the set obtained is C; (b) P,, — C according to the 
definition of Blaschke (see Theorem 9.3 below and [2, p. 60]). 

Remark 3.2. By means of the approximating polyhedra, we can set up a 
denumerable set of points everywhere dense in C; hence C is separable (other 
proofs are known). An examination of Carathéodory’s proof of Blaschke’s 
Auswahlsatz shows that it is valid in any separable set; hence the validity of the 
Weierstrass-Bolzano Cluster Point Theorem for points in a closed convex set 
implies the validity of the same theorem for closed convex sets [2, pp. 62-66]. 

Remark 3.3. Let S; and S, be two closed, compact sets with convex hulls 
H,, H.. If the distance from any point of S; to S: is at most d, then the dis- 
tance from any point of H, to He is at most d, for the parallel set to H2 at the 

distance d obviously includes S;. 

Remark 3.4. The convergence to zero of the sequence of numbers in (3.1) 
is a necessary condition that the limit of the sequence of polyhedra be a set C, 
but it is easy to give an example with linear sets to show that it is not sufficient. 
We consider here a condition that is sufficient. Let a set of points x(1), x(2), --- 
be given; from it form a new sequence x(k,) = 2(1), x(ke) = x(2), and in general 
x(k,) equal to the first element of the original sequence not contained in the 
convex hull of those that precede it. Let P, be the convex hull of the first 
n points of the new sequence. A sufficient condition that the limit of the 
P,, Ps, --- be a set C, the limit being taken ag in Remark 3.1 (a) above, is 
that the points x(k), x(kz), --- belong to some set C. This condition is obvi- 
ously necessary. 


4. The two kinds of sets C. As shown by Theorem 2.3, some of the points 
which are extreme in the sense of distance are also extreme in the sense of Min- 
kowski. On the basis of this relationship we are able to distinguish two kinds 
of sets C. 

TuHeoreM 4.1. The set Ep(S) of a set C in a space L is closed if S is compact 
and closed. 

Let x be any limit point of Ep(S). Then there is a sequence 2, 22, +--+, 
x, € Ep(S), whose limit is x. We shall prove that x « Ep(S). 

By hypothesis there is a point y, « S such that a, € Eo(y.), k = 1, 2,---. 
Since S is compact and closed by hypothesis, we can select a sub-sequence from 
the y, which has a limit y, ye S. Then the limit of the sequence of the cor- 
responding 2; is 2, and we may suppose that this selection has been made before- 
hand so that y:, ye, --- approaches y. 

Suppose that x is not a point of C at maximum distance from y. Then there 
is a point z, distinct from x, which has this property, and 
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lz-yll=liz-yll+d, d>0. 
Furthermore 
llyx —2z|| 2lly—2||— lly — yll 
(4.1) . , || || 
2 ||t-—yll+d— lly. — y|l. 
But 
(4.2) lan — yall S lle —yl| + ||a— anal] + || y — yr ll. 


Since x, — 2, Yn — y, there exists an n sufficiently large, say N, so that 
\|2— zy || < d/4, |!y — yw || < d/4. 
Then from (4.1) and (4.2) we have 
llyy—z || 2|lz — y|| + 34/4, 
law — yw || S lle —y || + 4/2. 


These two inequalities contradict the assumption that zy e Ep(yw); hence, 
there exists no point z of C whose distance from y is greater than that of z. 
Then x ¢ Ep(S). The proof is complete. 

Definition 4.1. Ina space L* if there exists a closed compact set S such that the 
set Ex of a set C is identical with Ep(S), we say C isa set Cs. In all other cases, 
C is called a set Cx. 

As a result of this definition and Theorem 4.1, we have at once the following 
theorem. 

THEOREM 4.2. Ina space L* a sufficient condition that the set Ex of C be closed 
is that C be a set Cs. 

THEOREM 4.3. In spaces L* there exist both sets Cs and Cx. 

Any set with a finite number of extreme points Ey, such as a polyhedron, is a 
set Cs. In particular, every linear set C is a set Cs, the two end points being 
extreme points in both senses. The sphere || z — x || S risaset Csalso. Its 
boundary points ||z — 2 || = r form Ey (see Theorem 1.1), and S may be 
taken as the single point 2. 

We shall now give an example of a set Cx. Ona circle mark a point z, and 
let x; and zz be the end points of a segment which has z for its mid-point and is 
perpendicular to the plane of the circle. Let zs be the point on the circle 
diametrically opposite x; let 2, be one of the points on the circle which bisect 
the ares joining x; and x. In general, let z, , k = 5, 6, --- , be the point of the 
circle which bisects the are joining z,., and z. Then the sequence of points 
21, X2, --- has x as its single limit point. 

Let P,, denote the convex hull of 2, ---,2,. Then consider the set C of 
points p such that either (a) pe P,, for n = N(p), or (b) pis a limit point of the 
points (a). Then C is closed and compact, lies in a space L* (see §1), and it is 
convex also (see Remark 3.4). The points 2, 22, --- are the extreme points 
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Ey of C, but Ey is not closed, since their limit point z is not an extreme point. 
Hence, C is a set Cx, for if it were a set Cs, its extreme points Ey would form 
a closed set according to Theorem 4.2. 

The reader will readily construct plane sets C with points of Ey which are 
not extreme points in the sense of distance. Thus not every plane set C is a 
set Cs. In spite of this fact, the following theorem is easily proved. 

THEorEM 4.4. The set Ex of every plane set C is closed. 

Let pi, Po, --+ » Pre Ew CC, have the limit point p. Then the points p, and 
p belong to the boundary of C. Let 1 be the supporting line of C through p. 
Then C lies entirely on one side of l. If p is the only point of 1 in C, we see at 
once that p « Ey, and the theorem is true in this case. If the theorem is false, 
the only possibility is that p is an interior point of a segment of | which is con- 
tained in C. But in this case p would not be a limit point of points of Ey, 
contrary to hypothesis. Thus the theorem is true in all cases. 


5. On the convex hull of a set. We shall consider now the definition of the 
closed convex hull of a set S, and also two methods of constructing it. 

Definition 5.1. The closed convex hull of a set S is the set contained in all closed 
convex sets which contain S. 

The justification of this definition is the fact that the product of any number 
of closed convex sets is a closed convex set. Its weakness is that it does not 
enable us to establish directly many properties of the closed convex hull of S. 

Definition 5.2. The convex hull [4, p. 359] of S is the set of points 


12, + sali + Trtny 


where x,¢S,r; 20, andr, + --- +r, = 1. 

TuHeoreM 5.1. The closed convex hull of S is the closure of the convex hull of S. 

Still another method can be used to construct the closed convex hull of S 
in the special case that S is compact and lies in L. 

Let S be the closure of S. Then S is also compact [5, p. 89]. Let x be any 
point of L, and z; one of the points of S at maximum distance from z (it exists 
since S is compact). Let x2 be one of the points of S at maximum distance 
from x,. The convex hull of 2;, 22 is a polyhedron P:. We can continue 
as in the proof of Minkowski’s Approximation Theorem (the only difference is 
that C is replaced by 8) and show that either S is contained in some polyhedron 
P,,, or that there exists a denumerable sequence of polyhedra P; C P, C.--- 
with all the points Ew of each one contained in S. Let C be defined as the 
closure of the set of points x such that xe P, for n 2 N(x). It will be shown 
later that this set is compact; hence, the notation C is justified. 

THeoreM 5.2. Let C and P,, Po, --- be the sets whose construction we have 
just explained. Then the closed convex hull of the compact set Sin Lis C. Fur- 
thermore, given any € > 0, there exists an N(e) such that D(x, P,.) < eforn = N(e) 
and everyxeC. Finally, the set Ex of each polyhedron P,, belongs to 8. 

This theorem may be considered an extension of Minkowski’s Approximation 
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Theorem. Not only can we approximate to the convex hull of S by polyhedra, 
but we can do so by means of polyhedra whose sets Ey belong to 8S. 


6. A theorem on compact sets. Let D(x, @) denote the distance from z 
to the closed compact set G. 

THEOREM 6.1. Given a sequence G,, G2, --- of closed, compact sets in a metric 
space, and a set G for which the axiom of completeness is satisfied in addition. 
If for every « > 0 there exists an N(e) such that D(x, G,) < «for n = N(«) and 
x eG, then G is compact. 

Since the sets G, are compact, they are totally bounded [8, p. 108]. This 
fact and the hypotheses of the theorem show that G is totally bounded. Then 
since G is complete by hypothesis, it is compact [8, pp. 107-108]. 

In the next section this theorem will be applied to sets in a Banach space. 


7. The convex hull of a compact set in a Banach space. In this section we 
shall use the results of the last two sections to establish the following theorem. 

THEOREM 7.1. The convex hull of a compact set S in a Banach space is itself 
compact. 

Since S is compact, we can use the method of Theorem 5.2 for constructing 
its closed convex hull C. The polyhedra P, are closed and compact and satisfy 
all the other hypotheses for the sets G, in Theorem 6.1. The set C satisfies 
all the hypotheses for the set G in the same theorem. Thus Theorem 7.1 fol- 
lows immediately from Theorems 5.2 and 6.1. 

This theorem, although discovered independently by the author, was first 
proved by Mazur [6]. 


8. The significance of Minkowski’s Approximation Theorem. Let S be a 
compact set in a linear metric space L. Then it is well known that S is sepa- 
rable, i.e., that there exists a denumerable set of points which is everywhere 
dense in S. We are therefore led to inquire what additional information is 
given by Minkowski’s Approximation Theorem. The answer is the following 
(see Theorem 5.2): Let a compact set S in L be given. Then there exists a 
denumerable set of points 2, 22, --- , 2% € S, such that for each « > 0 there 
is an N = N(e) for which ||z — (rai + --- + rwaw) || < €, where r; 2 0, 
mr + --- + ry = 1, and zis any point in the closed convex hull of S. Further- 
more, the set of points x,, 22, -- - in general is not everywhere dense in S. 

The set of points 2,, 22, --- is of course not unique. This fact, for example, 
indicates clearly the great variety of sets of functions, linear combinations of 
which can be used to approximate to all the functions of a given set. 


9. Theorems on the set Ey. We shall now prove several theorems which 
will establish more precisely the relation between C and Ey. 
THEOREM 9.1. Let C in L* be given. Then the closed convex hull of Ey is C. 
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Let the closed convex hull of Ey be C’. Then since Ey C C, it is clear that 
C’ CC, and the proof will be complete if we can show that C CC’. 

Suppose C €C’. Then D(z, C’), which is defined and continuous for z in the 
closed, compact set C, takes on its maximum value d > 0 on a closed compact 
set X CC, X €C’. Then the set X lies on the boundary of the parallel set 
Sto C’ at the distance d. Let x be a point of X at maximum distance from some 
point y of C’. Then by the arguments used in the proof of Theorem 3.1, we 
can show that z is a point of Ey of C. But this is impossible, because Ey is 
contained entirely in C’. As a result of this contradiction we conclude that 
C CC’, and the proof is complete. 

Corotuary 9.1. A set C in L* is determined by any subset Ey of Eu whose 
closure contains Ew. 

THEOREM 9.2. Let, --- , £, be points of a set Cin L andr, --- , 7, numbers 
greoter than zero whose sum is 1, and let x = ryt, +--+ + etn. Then ze C, 
but if n > 1, x is not a point of Eu. 

Ifn > 1, then 0 <r; < 1,7 = 1, --- ,, and we can find positive numbers 
€:,---, €n such that 0 < ry; + e < 1, and such that e, = e +--+: + en. 
Then z is the mid-point of the segment joining the points (r; — «)x, + --- 
+ (Tra €n—1)Ln—-1 + (r, + En) In and (7, + €:) 21 i oo (rn + €n—1)La-1 
+ (rn — €x)%n, Which belong to C, and by definition (1) of §1, z is not an ex- 
treme point. 

The following theorem is well known [7, §9, p. 200]. 

TueoreM 9.3. Let S be an arbitrary closed bounded set in euclidean N-space 
and C its closed convex hull. If x is any point of C, then there exist points x; «8 
and numbers r;, 7; > 0,1 = 1,---,n, withry + --- +7, = 1, such that xz = 
12, + --- +7r.27,,andn=N+1. 

TueoremM 9.4. Let S be an arbitrary closed bounded set in euclidean N-space, 
and C its closed convex hull. Then Ey CSCC. 

The proof follows at once from the two preceding theorems. Let x be a point 
of C which is not a point of S. Then by Theorem 9.3, there are points x, --- , 
z, of Ssuch that « = mz, + --- + 7r,2,. Also n > 1, for otherwise x would 
be a point of S contrary to hypothesis. Then by Theorem 9.2, z is not a point 
of Ey. It follows that Ex C S, and the proof is complete. 

THEOREM 9.5. In euclidean N-space let a set C with extreme points Ey be given. 
A necessary condition that the closed convex hull C’ of Ey C Ey C C contain 
tm, tu € Ey is that xm be a point or a limit point of Ew. 

To prove this theorem, we shall suppose that zy is not a point of Ey, and 
show that it does not belong to C’. Suppose ry ¢C’. Then zy is not an ex- 
treme point in the sense of Minkowski of C’ because by Theorem 9.4 all such 
points belong to Ey. Then it follows that rw is not an extreme point in the 
sense of Minkowski of C, for C’ C C. But this contradicts the hypothesis 
that rye Eu CC. Then if zy «C’, it follows that ry «Ey, and the proof is 
complete. 

Remark 9.1. It seems likely that Theorem 9.4 is true for sets C in spaces more 
general than euclidean N-space; hence, we consider the following propositions: 





ers 











se 


" FAS 





ON EXTREME POINTS OF CONVEX SETS 65 


(A) Let S be an arbitrary closed compact set in L, and C its closed compact hull. 
Then Eu CSCC. 

(B) Let a set C in L with extreme points Ey be given. A necessary condition 
that the closed convex hull C’ of Ey, Eu © Eu CC contain xm, xu € Ew, is that 
amu be a point of the closure of Ew. 

An examination of the proof of Theorem 9.5 will show that the following 
corollary is true. 

Coro.uiary 9.2. For every set C in L, (A) implies (B). 

Throughout the remainder of this section, we shall treat sets C in L for which 
(B) holds, no assumption being made about (A). The theorems established 
will have content because of Theorem 9.5. Among other results, we shall obtain 
the following restricted converse of Corollary 9.2 (see Corollary 9.5 below): 
For every set C in L*, (B) implies (A). 

Coro.tiary 9.3. If C is in L, and if (B), a necessary condition that the closed 
convex hull of Ey C Eu CC be C is that the closure of Ey contain Ey. 

Corouiary 9.4. If C is in L*, and if (B), then the closure of the set of points 


21, %2,--- obtained in the proof of Minkowski’s Approximation Theorem 3.1 
contains Ey. 

TuHeoreM 9.6. Jf C,, Co, --- is a sequence of closed, compact, convex sets in 
L* which have the limit C in L* in the sense of Blaschke, if (B), and if ry « Ey CC, 
then there exists a sequence of extreme points tw €Ci, Ly €Ce, +++ whose limit 
is Im. 


Let v, be the limit inferior of numbers d, such that the distance from any 
point of C to C,, and from any point of C, to C, is equal to or less than d,. 
From the hypothesis that C,,— C in the sense of Blaschke, it follows that v, — 0 
asn— @, 

Suppose that the theorem is false. Then there exists an « > 0 such that 
for every N 2 1 there exists a set C, with k 2 N which has no extreme point 
in the neighborhood || x — zw || < «. 

Let S, S, be the sets of points z of C, C, such that || 7 — rw || 2 €/2, || 2 — 
rm || 2 e respectively. Let the closed convex hulls of S, 8, be H, H,. There 
exists an n, say N,, such that », < «/2forn 2 N,. Then we can assert that 
the distance from any point of S, to S does not exceed v, at least for n 2 N;. 
Furthermore, by Remark 3.3 the distance from any point of H, to H does not 
exceed v,, forn = Ny. 

From the hypothesis that proposition (B) holds, it follows that the distance 
from zy to H is positive. Call this distance d. Then there exists an n, say 

= Ne, such that v, < d/2forn = Ne. Then for n 2 N, where N is the 
larger of the integers N, and Ne, the distance from any point of H, to H is 
less than d/2, and therefore the distance from zy to H, is greater than d/2. 

Finally, from the assumption that the theorem is false as stated above it 
follows that there exists a set C,, with k 2 N, all of whose extreme points lie 
in S,;. Then H, is identical with C;, because S; contains only points of C;,, 
and it contains all of its extreme points Ey (see Theorem 9.1). Thus from the 
statement made above we see that the distance from ry to C; is greater than 
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d/2. But this contradicts the fact that »,. < d/2. Thus the assumption that 
the theorem is false has led to a contradiction, and the theorem is established. 

Remark 9.2. In the euclidean plane let C,, be the convex hull of the points 
(0, 1 + 1/n), (0, — 1 — 1/n), (1/n, 0), (— 1/n, 0). The limit set C is the line 
segment joining (0, 1) and (0, — 1), and these two points are its only extreme 
points. The point (0, 0) is also a limit point of points zy of the sets C,. Thus 
not every limit point of points ry of the sets C,, is a point ry of C. For linear 
sets, however, every limit point of extreme points is an extreme point of the 
limit set C. 

Coro.uary 9.5. Lf S is any compact set in a space L*, and if (B), then the 
set Ey of the closed convex hull C of S is contained in S. 

The proof of this corollary follows from Theorems 9.6 and 5.2, for it was shown 
in the latter that it is possible to construct a sequence of polyhedra whose sets 
Ex belong to S, and whose limit in the sense of Blaschke is C. 

From this corollary we obtain the following result, stated above. 

Coro.uary 9.6. For every set C in L*, (B) implies (A). 


10. Some examples. Consider again the functions f(s) discussed in §1 as an 
example of a space L which is not L*. The functions | f(s) | S M form a closed, 
not compact, convex set. Every function f(s) such that max | f(s) | = M is an 
extreme element in the sense of distance, but it can be shown easily that the 
only extreme elements in the sense of Minkowski are the two functions f(s) = 
+M. The closed convex hull of the set Ey is not the given closed convex set 
(see Theorem 9.1). 

Let x be the real number triple (2, x2, 23), and let || x || = max |a;|,7 = 
1, 2,3. Then the “spherical neighborhood” || z || S ris the cube with vertices 
at the points (+r, +r, +r); it is a closed, compact, convex set. The space is a 
space L but not L*. We thus have an example of a space which is an L* with 
the usual euclidean distance function and not an L* with another one. In both 
‘ases, however, the vertices of the cube form its set Ey, and their closed convex 
hull is the cube itself. Thus the conditions in Theorem 9.1 are sufficient, but 
not necessary. We see also that the extreme points in the sense of distance of 
the closed convex hull of a set S need not belong to S (see Corollary 9.5). 

So far we have characterized an extreme point zy by the original definition 
in §1 and by Theorem 9.2. One might expect to characterize zy at least in 
euclidean space also by some property of the supporting plane through the 
point, but this seems impossible. Consider, for example, a plane set C whose 
boundary contains two intersecting straight line segments and an are of a circle 
tangent to one of them. Then we see that there may be either one or many 
supporting lines through a point zy which have a single point in common with 
C, or a single one through ry, which, however, has a segment in common with C. 
If a supporting line contains only a single point of C, then that point is a point 
xu, however. 
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ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER* 


By REINHOLD BAER 


An abelian group which is written so that its symbols are combined by addi- 
tion and which has no elements of finite order other than 0° may be called 
completely reducible, if it is a direct sum of groups of rank one. For every 
group is contained in a completely reducible group of the same rank. There 
exist furthermore direct irreducible groups of every finite rank and the groups 
of rank 1 are exactly the subgroups of the additive group of the rational numbers 
and therefore irreducible. 

The structure of a completely reducible group is uniquely determined by the 
ranks of the differences of certain characteristic subgroups. A survey of the 
structures of all subgroups of completely reducible groups would involve the 
solution of the general structure problem, since every group is contained in a 
completely reducible group. But it is possible to characterize a class of com- 
pletely reducible subgroups (of completely reducible groups) which are iso- 
morphic with a direct summand of the whole group. 

ivery property of a completely reducible group which refers to finite subsets 
or to subgroups of finite rank also holds true for separable groups, i.e., for groups 
whose finite subsets are contained in completely reducible direct summands. 
Countable separable groups are completely reducible. But there exist separable 
groups which are not completely reducible, e.g., vector groups like the additive 
group of all the sequences of integers. Further criteria for complete reduci- 
bility, for separability and for the complete reducibility of separable groups are 
given. There exists in particular a characterization of the direct summands of 
finite rank which holds true in every separable group and which is valid in a 
group of finite rank if, and only if, this group is completely reducible. Further- 
more, every direct summand of finite rank of a separable group is completely 
reducible. 

The subsets b’ and b” of the group J are isotype in the group J, if there 
exists a proper automorphism of J which maps b’ upon b”. The classes of iso- 
type elements of a separable group are determined by complete sets of invariants 
and an enumeration of the characteristic and of the strictly characteristic sub- 
groups is based on this classification of the elements. The existence of char- 
acteristic subgroups which are not strictly characteristic and the existence of 
elements which are not isotype though contained in the same characteristic 
and in the same strictly characteristic subgroups are closely related phenomena; 
but neither is a consequence of the other. 


Received November 3, 1936; presented to the American Mathematical Society Sep- 
tember 3, 1936. 

' The word ‘“‘group”’ is substituted for this longer statement wherever there is no danger 
of confusion. 
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A subgroup S of the group J is completely reducible in J, if some direct 
decompositions of J induce complete reductions of S. Not even all subgroups 
of rank one of a completely reducible group J are completely reducible in J. 
But it is possible to give criteria for the complete reducibility of subgroups of 
finite rank in separable groups. If the subgroup S of J is completely reducible 
in J, and if the rank of S is finite, then the type of S in J can be described by 
means of invariants which are derived from the existence of characteristic sub- 
groups of S in characteristic subgroups of J. 

All properties and concepts used are invariants and their definitions are based 
on the concept of multiplicity of an element z in the group J, i.e., the l.c.m. 
of all the positive integers n such that z = 0 mod nJ. The calculus with these 
generalized numbers (in the sense of E. Steinitz) is the basis of the methods 
applied here. 

Previous work in this domain concerns mainly direct sums of a finite number 
of infinite cyclic groups. They form a rather special, though important, class 
of completely reducible groups and their theory is embodied in the theory of 
completely reducible and separable groups as developed here. 


Chapter I. Preliminaries 


1. Dependence, rank and rational multipliers.” Let J be an abelian group 
whose elements are combined by addition and which contains no non-zero 
element of finite order. If x is an element of J and n an integer, then nz = 0 
implies that n = 0 or x = 0. An element x of J is therefore dependent on 
the subset S of J, if there exists a positive integer n such that nz is contained 
in the subgroup of J which is generated by the elements in S. A subset of J is 
dependent, if at least one of its elements depends on the others. A subset of J 
is therefore independent if, and only if, all its finite subsets are independent, 
and the finite subset b:, --- , be is independent if, and only if, }> ¢; b; = 0 

i=1 
implies that all the integral coefficients c; are 0. 

The rank of J is the smallest (finite or infinite) number r(./) such that there 
exists a subset of J on which every element of J is dependent and which con- 
tains r(J) elements. There always exist greatest independent subsets of J. 
If G is a greatest independent subset of J, then every element of J is dependent 
on G and G contains exactly r(J) elements. 

The subset C of the group J is closed (in J), if C contains every element of J 
which is dependent on C. Closed subsets are subgroups and a subgroup S of 
J is closed in J if, and only if, the class group J/S does not contain elements + 0 
of finite order. 

Since the intersection of any number of closed subgroups is also a closed 
subgroup, there always exists a smallest closed subgroup which contains a given 


2 For proofs of the facts mentioned in this section, see R. Baer, The subgroup of the 
elements of finite order of an abelian group, Ann. of Math., vol. 37 (1936), pp. 766-781. 
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set S: the closed subgroup, generated by S. It contains exactly those elements 
which are dependent on S. 

In groups of rank one any pair of elements is dependent. The elements 
x ~ 0 and y + O are dependent if, and only if, there exist integers n and m 
such that 


nz = my ~ 0. 


The ratio of n and m is uniquely determined by the dependent elements x and y; 
and to given x, n, m there exists at most one solution y of the equation nz = my. 


° ° ° ° n 
If the equation nx = my has a solution y in J, the notation y = —z may be used. 
m 


By this definition a multiplication of the elements of J with rational numbers 
has been introduced. Since it is possible that rb’ exists in J, but rb’ does not, 
the rational numbers are not operators in the usual sense. These rational 
multipliers satisfy the following rules. 

Suppose that r and s are rational numbers, x and y elements in J. 


If rx and ry exist in J, then r(x + y) If rx and sx exist in J, then (r + s)x 
exists in J and satisfies exists in J and satisfies 
r(xaty) =rrtry. (r+ s)t = rx + sz. 


If rx and s(rx) exist in J, then (sr)x exists in J and satisfies (sr)x = s(rz). 
If rx and sx exist in J, and if the denominators of r and s are relatively prime, 
then (rs)x exists in J and satisfies 


(rs)x = r(sx) = s(ra). 


A proof of the last formula may be added. If r = mn, s = hk and nandk 
are relatively prime, then there exist integers k’, n’ such that nn’ + kk’ = 1 
and therefore 


mhx = mhkk’x + mhnn'x = kn(hk'rx + mn’sz). 


It is important to note that the converses of these rules are not true. 

These formulas show that the closed subgroup of J which is generated by the 
element x # 0 is simply isomorphic with the additive group of those rational 
numbers which are multipliers of z. Groups of rank one are therefore exactly 
the subgroups of the additive group of all the rational numbers. Hence they 
may be called rational groups. 

The group J is completely reducible, if J is a direct sum of rational groups. 
If J is the direct sum of the rational groups J, and b, # 0 an element of J, , 
the elements b, form a basis of J. The subset B of J is therefore a basis of 
J if, and only if, B is a greatest independent subset of J and an equation 


k 
nr== > c;b; 


| 


where n is a positive integer, x ~ 0 an element in J, the c; are integers and the 
° ° ° ° = ° P 
b; different elements in B, implies that every (c;n_)b; exists in J. 
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J is complete, if J = nJ for every positive integer n. Complete subgroups 
are direct summands. Complete groups are direct sums of groups which are 
isomorphic with the additive group of all the rational numbers. Every greatest 
independent subset of a complete group is a basis. Every group is contained 
in an essentially uniquely determined smallest complete group. 


2. Multiplicities and derived invariants. In the following a certain general- 
ization of the multiplicative set of the positive integers will be needed.’ The 
principal concepts and properties of these generalized numbers will therefore 
be enumerated. 

If, for every prime number p, v(p) is either a non-negative integer or the 
symbol , then 

v= |] p"’ 
> 


is such a generalized number v and v is uniquely determined by its p-values v(p). 

If S is any set of (ordinary or generalized) numbers, the product of the num- 
bers in S is the number whose p-value is the sum of the p-values of the numbers 
in S. Here the sum of an infinity of positive integers is © and the sum of « 
and anything is ©. The p-value of the g.c.d. (l.c.m.) of the numbers in the 
set S is the minimum (maximum) of the p-values of the numbers in S. 

v is a divisor of w, w a multiple of v, i.e., v / w, if there exists a solution x of 
the equation w = xv. v/ wif, and only if, v(p) S w(p) for every prime number p 
(where v(p) is the p-value of v and w(p) the p-value of w). If v / w, there may 
exist many solutions of the equation w = xv. The g.c.d. w,v and the l|.c.m. 
w,v of all the solutions z of w = zv are also solutions of this equation, the small- 
est and the greatest, respectively. If v / w, the p-value of 











w,v is w,v is 

w(p) — v(p) w(p) — v(p) if w(p) is finite 
x“ ra if w(p) is infinite, v(p) finite 
oo 0 if v(p) is infinite. 


The infinite part of the number v is v,, = v,v and the finite part is vy = v,v,,. 
The symbols v,, and v; are relatively prime, the p-values of v,, either 0 or © and 
the p-values of v; are all finite. We have »v = vy,,. 

1 is the g.c.d. of all the numbers. Every number is the l.c.m. of ordinary 
integers and the l.e.m. of all numbers is the number without finite p-values. 
It will be convenient to add to these numbers a symbol « which is different 
from all numbers and a multiple of every number. 

The numbers v and w have the same genus, i.e.,|v| = | w|, if there exist 
ordinary positive integers m and n such that mv = nw. Hence | v|= | w| if, 
and only if, v,, = w,, and for almost every p the p-value of v is the same as the 
p-value of w. 


3 E. Steinitz has used this generalization for enumeration of the finite fields. 
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If | v;| = | wi! , then | ve | = | wy, | and 


i 
list.) = | wiewe|, Vigt2| =| Wig We]. 


Product, g.c.d. and l.c.m. of a finite number of genera are therefore uniquely 
determined genera. 

If the three genera a, b, ¢ satisfy a = be, then b is a divisor of a and aa multi- 
ple of b,ie.b S a. Ifb S aanda<b,thena=b. Ifa < banda +b, 
then a <b. The genus a is a divisor of the genus b if, and only if, there exist 
numbers of genus a which are divisors of certain numbers of genus b. 

DEFINITION 2.1. The multiplicity m(x) = m(x < J) of the element x in the 
group J is ©, if x = 0, and is the |.c.m. of all the positive integers n such that 
x = 0Omod nJ (i.c., such that n“'z exists in J), if x ¥ 0. 

If x # 0, the genus |x| = |x < J| of x in J is the genus of the multiplicity 
of xin J. 

If z is an element of the subgroup S of J, then m(x < S) is a divisor of 
m(x < J) and, if x # 0, then | x < S| is a divisor of |4 < J|. If Sis a closed 
subgroup of J, the elements of S have the same multiplicity and the same 
genus in S and in J. 

If Sis a (closed) subgroup of J, z an element of J, the multiplicity m(x < J/S) 
= m(S +2 < J/S) of e mod Sis a multiple of m(x < J) and, if x ¥ 0 mod S, 
the genus |x < J/S| = |S +2 < J/S| of x mod S is a multiple of its 
genus in J. 

If n is a positive integer, then m(-:nx) = nm(r) and n'‘z exists in J if, and 
only if, n is a divisor of m(z < J). If ris a positive rational number such that 
rz exists in J, then m(rz) = rm(z). 


(2.2) (a) If = >> 2;, then m(x) is a multiple of the g.c.d. of the m(x;) and, 
i=1 
if x, x; are all ¥ 0, then | x | is a multiple of the g.c.d. of the | x; | 
k 
(b) If x = > 2;, and if the elements x; ¥ 0 are contained in different com- 
i=1 


ponents of a direct decomposition of the group J, then m(x) is the g.c.d. of the m(x;) 
and | x | the g.c.d. of the | x; |. 

(c) The elements x # 0 and y # 0 generate isomorphic closed subgroups of the 
group J if, and only if, |x| = | y|. 

(d) The closed subgroup of J, generated by the element x # 0, is isomorphic 
with a subgroup of the closed subgroup of J, generated by the element y # 0, if 
and only if, |x| < |y}. 

Proof. (a) and (b) are consequences of the corresponding facts for the p- 
values of the multiplicities. If x and y are contained in the same group of 
rank one, there exist integers n and u such that nr = uy # 0 and this implies 


* The assumption that there do not exist elements # 0 of finite order in J is not needed 
for this definition. If J is an abelian group without elements of infinite order, the multi- 
plicity is essentially Priifer’s ‘‘Héhe’’. 
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the necessity of the conditions in (c) and (d). If finally the condition of 


(c) (d) 


is satisfied, there exist integers n # 0, u # 0 such that 


m(nx) = m(uy). m(nx) | m(uy). 
If therefore r # 0 is a rational number, then 


r(uy) exists if, and only if, r(nx) exists. r(uy) exists, if r(nx) exists. 


Hence in mapping r(nxz) upon r(uy) a required isomorphism is defined. 

By (c) all the elements # 0 of a rational group have the same genus. The 
genus of the elements ~ 0 of the rational group R is called the genus | R | of R. 

(c) and (d) imply that the rational groups R and R’ are isomorphic, if R is 
isomorphic with a subgroup of R’ and R’ with a subgroup of R. 

DEFINITION 2.3. If f(x) is a property, then (J, f(x)) is the (not necessarily closed) 
subgroup of J, generated by the elements x of J which satisfy f(x). 

f(x) is an additive property, if 


(2D Jef) = De, f@). 


The following properties f(x) will be used: g / m(x < .J/); the order of x mod 
the subgroup S of J is a finite divisor of g;s S |x <J\i;s<|xe<J\;|a< 
J| < s;|2 < J| €-s. Here g is an (ordinary or generalized) number and 
S a genus. 

All these properties (except the second) are additive. If f(x) is one of the 
three first properties, every element of (J, f(x)) has the property f(z). 

Since (J, g/m(x < .J/)) is the intersection of the groups nJ for finite divisors 
n of g, and since its structure depends only on g and on the structure of J, it 
may be denoted by gJ. 

The closed subgroup of J, generated by gJ, is exactly (J,|g| S |x < J |) 
and the groups (J, s S | x < J |) are therefore closed subgroups of J. 

If f(x, g, S) is the second property, and if g / m(x < J) for every z in S, then 
(J, f(x, g, S)) is the join of the subgroups nS of J for finite divisors n of g. 
Since under this assumption the structure of (J, f(x, g, S)) depends only on g 
and on the structure of S, and since every group is contained in a complete group, 
this property may be used for defining gS. If g, = 1, then g'(gJ) = J; 
if furthermore J = (J, |g| S |2!), then gg 'J) = J. Thus under these 
assumptions the structures of J and gJ and the structures of J and g ‘J deter- 
mine each other. Note that for a rational group R and a prime number p either 
p R = Ror p’R = 0. 

Finally, m(x < g J) = gm(x < J) and gm(z < gJ) = m(x < J). 

(24) (a) V,s<lz|)SWU,ssiz\), W,|2| €s) = WV,|x| £3), 


(VJ,s<|z|)sW,|4| €s) W,sslzr|)SV,|2| <8). 


lA 


5» J. is here and in the future the direct sum of the groups J, - 


v 
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(b) Every class of J(s)* = (J,s S | x|)/(J,s < | 2 |) is contained in exactly 
one class of J(s)** = (J, | 2!) « s)/(J,|2) £8). Thus a homomorphism of 
J(s)* upon the whole group J(s)** is defined and this homomorphism is an iso- 
morphism if, and only if, 

(b*) (J,s < | x |) ts exactly the intersection of (J,s S | x |) and of (J,|x| £8). 

These statements are consequences of the definitions and of the following fact. 
If y is an element of (J, |x| < s), then y is the sum of elements y; and of ele- 
ments z; such that s S | y;| ands ¢ |2;| « s. Since the y; are therefore 
contained in (J, s S |x|) and the z; in (J, |x| ¢€ s), this implies that every 
element of (J, | 2! <« s) is mod (J, 2, £ 8) congruent to an element of 
(J,s S| 2)). 

DEFINITION 2.5. The direct decomposition J = Zz: J, isa partial reduction of J, 


af for every v all the elements # 0 of J, have the same genus in J, (and in J). 
Every complete reduction of the group J is by (2.2c) also a partial reduction. 
If J = >> J, is a partial reduction of the group J/, let J(t) be the sum of all 


those J, whose elements # 0 have the genus t. Then J = >> J(t) is also a 
t 


partial reduction of J, since every element # 0 in J(t) has the genus t. Since, 
therefore, elements ~ 0 in different components J(t) have different genera, 
this decomposition is a smallest partial reduction of J. 


(2.6) If J = >> J(t) is a smallest partial reduction of J, then 
t 


(WJ,t<|r|) = + Ws), VJ,t< |r|) => As), 
tss t<s 

(J, |2| «t) = + Hs), VJ, |2| €t) = + Js) 
s<t sit 


and J(t) represents exactly the classes of J(t)* and the classes of J(t)**. 

These are consequences of (2.2) and (2.4). 
(2.7) Suppose that every finite subset of the group J is contained in a partially 
reducible direct summand of J. 

(a) (J,t <|2x]), (J,|2| « t) and (J,|x\| £ t) are closed subgroups of J. 

(b) (J, t < |x|) ds for every genus t the intersection of (J, t S |x|) and 
(J,|a2|  t). 

(c) All the elements ¥ 0 of J(t)* and of J(t)** have the genus t. 

(d) To every element b # 0 of J there exists at least one and at most a finite 
number of genera t such that 


b = Omod (J, |x| £ t), b # Omod (J, | a2! € t). 


Proof. If b is any element of J, there exists a partially reducible direct sum- 
mand D of J which contains b. If f(x) is one of the three discussed properties, 
then (D, f(x)) is a direct summand of (J, f(x)) and by (2.6) a direct summand of 
D and now (a)-—(c) are consequences of (2.6) and (2.4). If b # 0 and D = 

. 


> D(t) is a smallest partial reduction of J, then b = > b;, b; ¥ 0, b; in D(t,), 
t "Ld | 
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t; = t; fori # 7. The genera t such that b = 0 mod(J, | xz! < t), b ¥ 0 mod 
(J, |x| ¢£ t) are exactly the genera t; such that t; « t; for every j # 7, and this 
proves (d). 

THEOREM 2.8. (a) Suppose that J; is a direct sum of rational groups of genus 
t;. Then J; and J2 are isomorphic if, and only if, 


t = te, r(Ji) = r(J2). 


(b) Suppose that J; is partially reducible. Then J, and Jz are isomorphic if, 
and only if, for every genus t, J,(t)* and J2(t)* are isomorphic. 

(c) Suppose that J; is completely reducible. Then J, and Jz are isomorphic if, 
and only if, for every genus t 


r(Ji(t)*) = r(Jo(t)*). 


Proof. If J; is a direct sum of rational groups of genus t;, the number of 
summands is r(J,;) and (J;, t; < | 2|) = 0, Ji = (Ji, t; S | xz}). This proves 
(a). (b) is a consequence of (2.6) and (c) is a consequence of (a), (b), (2.6). 

CoROLLARY 2.9. Any two decompositions of a group J into rational direct 
summands are isomorphic. 

This is true since there appear exactly r(J(t)) summands of genus t in such 
a decomposition. 


Chapter II. Direct summands of finite rank 


3. Direct summands and complete reducibility of groups of finite rank. If 
the group J is the direct sum of its subgroups S and T, i.e., J = S + T, then 


(J,s S|z|) = (S,s S|z|)+ (7,8 <= |z)), 


(J,s <|2r|) = (S,s <|2])+ (7,8 < |r)). 


The subgroup of J(s)* which contains elements of the direct summand S of J 
is therefore a direct summand of J(s)*. Hence a direct summand S of the 
group J satisfies the following conditions. 

ConpiTI0n 3.1. The classes of J(s)* which contain elements of S form a closed 
subgroup of J(s)*.° 

ConpiTION 3.2. (S,s < |x|) is the intersection of S and (J,s < | 2x)). 

The above formulas imply: 
(3.3) The subgroup S of the direct summand D of the group J satisfies the 
conditions 3.1 and 3.2 in J if, and only if, S satisfies these conditions in D. 


6 It may happen that (J, s < | x |) is not a closed subgroup of J and that consequently 
J(s)* contains elements ~ 0 of finite order. Then the ‘‘closed’’ subgroups of J(s)* are 
not defined and therefore the following proposition may be substituted for condition 3.1, 
if (J, s < | x |) is not closed: a congruence nz = s mod (J, s < | x |), where nis an ordinary 
positive integer and s an element of S whose genus in J is s, has a solution z in J if, and only 
if, there exists a solution z in S. 

Note that this proposition and condition 3.1 are equivalent, if (J, s < | x |) is a closed 
subgroup of J. 
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THEOREM 3.4. The group J of finite rank is completely reducible if, and only if, 
every subgroup S of J which satisfies the conditions 3.1 and 3.2 is a direct sum- 
mand of J. 

Proof. A. Suppose that J is a completely reducible group of finite rank 
and that the subgroup S of J satisfies the conditions 3.1 and 3.2. 

Case 1. J is a direct sum of a finite number of isomorphic groups (of rank 1) 
and r(S) = 1. 

By condition 3.1 S is a closed subgroup of J. Thus if r(/) = 1, then J = S. 
Suppose now that r(/J) = 2. Then J = J’ + J” and either S is one of these 
direct summands or there exist elements b’ # 0 in J’, b” # 0 in J” and rela- 
tively prime positive integers n’, n”’ such that n’b’ + n’’b’”’ is an element # 0 
of S and m(b’) = m(b’’) = m(n'b’ + nb”). There exist therefore integers 
k’, k’”’ such that k’n’ — k’'n”” = 1. Then the elements 


¢ = n'b’ + nb”, cl’ _ k’’b’ + k’b” 


form a basis of J, since m(c’) = m(c’’) = m(b’) = m(b”), and since the subgroup 


” r ” - . » 
of J, generated by c’ andc , contains b’ andb . Thus S isa direct summand of J. 
r(J) 


If, finally, J = >> J;, where the J; are isomorphic rational groups, r(J) any 
i=1 
r(J) 
finite number, and s # 0 any element in S, then s = b 8; with s;in J;. Then 
6=) 


r(J) r(J) 
s’ = )> s; generates a closed subgroup S’ of J’ = }> J; and, by complete 
t=2 1=2 


induction, S’ is a direct summand of J’. Thus S is contained in a direct sum- 
mand of rank 2 and is therefore, as proved above, a direct summand of J. 

Case 2. J is a completely reducible group of finite rank and r(S) = 1. 

Let s be the genus of S and let S* be the subgroup of J(s)* whose classes 
contain elements of S. Then every element of S is contained in one class of S* 
and every class of S* contains by condition 3.2 exactly one element of S.  S* is 
by condition 3.1 a closed subgroup of rank one of J(s)*. Since J is completely 
reducible and of finite rank, and since therefore, by (2.6), J(s)* is a direct sum 
of a finite number of isomorphic rational groups, it follows from case 1 that S* 
is a direct summand of J(s)*, i.e., J(s)* = S* + T*. If T is the subgroup of 
(J, s S |x|) which contains (J, s < |x|) and satisfies 7* = T/(J,s < |2}), 
then (J, s S |x|) = S + T, since S represents exactly S*. Since, by (2.6), 
(J,s < |x|) is a direct summand of J, this implies that S is a direct sum- 
mand of J. 

Case 3. J is a completely reducible group of finite rank. 

There exist in S elements w # 0 such that |w < S| <« |s < S| for every 
element s # Oin S. 

For if b # 0 is an element in S whose genus in S is s, and if s < t, then (S, 
t < |x|) < (S,s S |x|) and therefore, since both these subgroups are closed 
subgroups of S, r((S,t < |x|)) < r((S,s S| 2))). 

Let now w be an element # 0 in S such that | w < S| ¢ | s < S| for every 
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s ~ Oin S, let W be the closed subgroup of S, generated by w, and s the genus 
of W. «Since (S,s < |x|) = 0, and since S satisfies the conditions 3.1 and 3.2, 
W satisfies the conditions 3.1 and 3.2 in J and is therefore as group of rank one 
a direct summand of J, i.e., J = W + J’. 

Then S = W + S’ where S’ is the intersection of S and J’, since W Ss S. 
The subgroup S’ of J’ satisfies therefore the conditions 3.1 and 3.2 in J’, and 
since r(J’) = r(J) — 1, it can be assumed that S’ is a direct summand of J’. 
Thus it has been proved by complete induction with regard to r(J) that S is a 
direct summand of J. 

B. Suppose now that J is a group of finite rank such that every subgroup of J 
which satisfies the conditions 3.1 and 3.2 in J is a direct summand of J. 

Since J is a group of finite rank, there exists (as proved in A., case 3) an ele- 
ment w ~ Oin J such that |w < J| ¢ |b < J | for every element b # 0 in J. 
The closed subgroup W of J, generated by w, satisfies the conditions 3.1 and 3.2 
and is therefore a direct summand of J, i.e., J = W + J’. Every subgroup 
S of J’ which satisfies the conditions 3.1 and 3.2 in J’ by (3.3) also satisfies them 
in J and is therefore a direct summand of J and of J’. Since r(J’) = r(J) — 1, 
it can therefore be assumed that J’ is completely reducible. Hence the com- 
plete reducibility of J has been proved by complete induction with regard 
to r(J). 

Coro.uary 3.5. If J is a completely reducible group of finite rank, every direct 
summand of J is completely reducible. 

For if the subgroup S of the direct summand D of J satisfies the conditions 
3.1 and 3.2 in D, it follows from (3.3) and Theorem 3.4 that S is a direct sum- 
mand of J and therefore a direct summand of D, i.e., D is by Theorem 3.4 com- 
pletely reducible. 

Coro.uaryY 3.6. Suppose that all the elements + 0 of the group J have the 
same genus Sin J. 

(a) If J is complete, then J is completely reducible and every closed subgroup of J 
is a direct summand of J. 

(b) If J is not complete, every closed subgroup of J is a direct summand of J if, 
and only if, J is a completely reducible group of finite rank. 

Proof. (a) is a consequence of the fact that every closed subgroup of a com- 
plete group is complete and that complete subgroups are direct summands, 
complete groups direct sums of rational groups (§1). 

Since J = (J,s S | x|),0 = (J,s < | 2|), the closed subgroups of J satisfy 
the conditions 3.1 and 3.2. Every closed subgroup of J is therefore by Theorem 
3.4 a direct summand of J, if J is a completely reducible group of finite rank. 
Suppose now that J is not complete and that every closed subgroup of J is a 
direct summand of J. If r(./) is infinite, then J contains an independent subset 
bi , be, --- ,b:,---. The closed subgroup J’ of J, generated by these elements, 
is a direct summand of J. Since every subset of the sequence of the b; generates 
a closed subgroup of J which is a direct summand of J and of J’, the elements 
b; form a basis of J’. Since J is not complete, there exists a prime number p 
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such that pJ < J. The closed subgroup of J and of J’ which is generated 
by the elements b;_,; — pb; is a direct summand J” of J and of J’. J” does not 
contain b}. But J = J’/J” satisfies pJ = J. This is a contradiction, since 
the p-value of m(b < ./) is finite for every b  Oin J. The rank of J is therefore 
finite. Since every closed subgroup of J is a direct summand of J, Theorem 3.4 
implies the complete reducibility of J. 

TueoreM 3.7. If there exists a (generalized) number g and a finite subset F 
of the group J such that g | m(f < J) for every f in F and such that J is generated 
by the elements nf with f in F and n any positive integer, dividing g, then J is a 
direct sum of a finite number of rational groups of genus | q |. 

Proof. Denote by fi, --- , fx the elements of F. Let J’ be a direct sum of 
k rational groups of genus | g| and f,, --- , f; a basis of J’ such that g = m(f; < 
J’). There exists a homomorphism a of J’ upon the whole group J such that 
fia = f;. If W’ is the subgroup of all the elements of J’ which are mapped 
upon 0 by a, then J’/W’ and J are isomorphic. W’ is therefore a closed sub- 
group of J’. W’ is by Corollary 3.6 a direct summand of J’ and J’/W’ is there- 
fore by Corollary 3.5 completely reducible. Hence J is completely reducible 
and J is by Corollary 2.9 a direct sum of rational groups of genus | g |. 

Corouiary 3.8. If there exists a finite subset F of J such that all the elements 
of F have the same genus s in J, and such that J is generated by the rational multi- 
ples of the elements in F, then J is a direct sum of a finite number of rational groups 
of genus Ss. 

For if g is the g.c.d. of the numbers m(f < J) with f in F, then m(f < J)ig = 
n(f) is an ordinary positive integer and the elements f’ = n(f)~'f for f in F form 
a subset F’ of J such that g, F’ satisfy the conditions of Theorem 3.7. 

Coro.iary 3.9.’ Suppose that all the elements + 0 of the group J of finite 
rank have the same genus in J. Then J is completely reducible if, and only 7, 
J/B is finite for every subgroup B which is generated by the rational multiples of 
the elements of a given greatest independent subset of J. 


Proof. Suppose first that J is a direct sum of a finite number of isomorphic 
r(J) 


rational groups, i.e., J = > J;, and that the subgroup B of J is generated 
i=1 


by the rational multiples of the elements of a greatest independent subset G of J. 
Then every element b # 0 of B has the same genus sin Band in J. If therefore 
J, is the intersection of B and J;, then J;/J; is a finite cyclic group. The 
direct sum J’ of the groups J; is a subgroup of B and J/J’ is a finite group, 
since r(J) is finite. Since J/B and (J/J’)/(B/J’) are isomorphic, J/B is also 
a finite group. 

Suppose now that G is a greatest independent subset of J, that B is the sub- 
group of J generated by the rational multiples of the elements in G, and that 
J/B is finite. Let F be a set of elements, containing G and a complete set of 
representatives of the classes of J//B. Then F is finite and satisfies the assump- 
tions of Corollary 3.8 and J is therefore completely reducible. 


7 A particular case of this proposition has been communicated to the author by Dr. G. 
Szekeres. 
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4. Direct summands of finite rank and separable groups. 

DEFINITION 4.1. The group J is separable, if every finite subset of J is contained 
in a completely reducible direct summand of J. 

Note that the assumptions of (2.7) are satisfied for separable groups. 

THEOREM 4.2. J is a separable group if, and only if, 

(a) every finite subset of J is contained in a direct summand of finite rank; 

(b) every subgroup of finite rank which satisfies the conditions 3.1 and 3.2 is a 
direct summand of J. 

Proof. If H is the direct sum of the rational groups H, , then every element 
of H is contained in the direct sum of a finite number of the groups H,. Every 
finite subset and every subgroup of finite rank of a separable group is therefore 
contained in a completely reducible direct summand of finite rank. Hence 
every separable group satisfies the conditions (a) and (b) by Theorem 3.4. 

If the group J satisfies the conditions (a) and (b), every direct summand of 
finite rank is by (3.3) and Theorem 3.4 completely reducible and J is therefore 
by (a) separable. 

Coro.uary 4.3. If J is a separable group, every finite subset of J is contained 
in a completely reducible direct summand of finite rank and every direct summand 
of finite rank is completely reducible.” 

Corotiary 4.4. If all the elements ¥ 0 of J have the same genus in J, then J 
is separable if, and only if, every closed subgroup of finite rank in J is a direct 
summand of J. 

This is a consequence of Theorem 4.2 and Corollary 3.6, since any n elements 
of J generate a closed subgroup of a rank S n. 

Coro.iary 4.5. Denote by C the greatest complete subgroup of the group J. 
Then every closed subgroup of finite rank of J is a direct summand of J if, and 
only if, 

(a) J/C is separable; 

(b) all the elements # 0 of J/C have the same genus in J/C. 

Proof. C is, as a complete subgroup of J, a direct summand of J, i.e., J = 
C + J’. Suppose now that every closed subgroup of finite rank of J is a direct 
summand of J. Either J’ is of rank one and conditions (a) and (b) are obvious, 
or there exist two different rational closed subgroups S and T of J’. The closed 
subgroup U of J’, generated by S and 7, is a direct summand of J and of J’, 
and S and 7 are direct summands of J and of U. By Corollary 2.9 either S 
and 7 or S and U/T are isomorphic. In the latter case lL’ = S’ + T, where 
S and S’ are isomorphic. If s # 0 is an element of S’, t # 0 an element of T, 
the closed subgroup of U, generated by s + ¢, is a direct summand of U and 
therefore |s +t<U|={|S|or=|T7|. Since the genus of s + tis the g.c.d. 
of the genera of s and of ¢, this implies that either | S| < | 7 | or|7| < | S|. 
Since S and 7 are not complete, and since the same argument applies to every 





8’ The analogous theorem for primary abelian groups may be mentioned. The primary 
abelian group P does not contain elements of infinite height if, and only if, every finite 
subset of P is contained in a finite direct summand of P. 
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element s + nt and ns + t with positive integer n, it follows from condition 3.1 
that S and 7 have the same genus, i.e., that J satisfies (a) and (b). 

Suppose now that J satisfies the conditions (a) and (b) that S is a closed 
subgroup of finite rank in J, and that S’ is its greatest complete subgroup. 
Then S’ is a direct summand of C, i.e., C = C’ + S’ and S = 8S” + S’, where 
S” is the intersection of S and J‘ + C’. The subgroup S* of J/C which con- 
tains all those classes, containing elements of S, is exactly represented by S” 
and by Corollary 4.4 is a direct summand of J* = J/C = S* + 7*. If T is the 
subgroup of J’, representing 7*, then J = T + 8S” + S’ + C’,i.e., Sis a direct 
summand of J. 

Lemma 4.6. If J is separable and S a direct summand of finite rank of J, then 
J/S is separable. 

Proof. If F* is a finite subset of J* = J/S, F a subset of J, representing F%*, 
then J has by Corollary 4.3 a completely reducible direct summand D of finite 
rank which contains S and F. Since S is a direct summand of J, S is also a 
direct summand of D, i.e., D = S + T and T is completely reducible by Corol- 
lary 3.5. The subgroup 7* of J*, represented by 7’, is a completely reducible 
direct summand of J* and contains F*, i.e., J/S is separable. 

THeoreM 4.7. Every countable separable group is completely reducible. 

Proof. Let bi, be, ---,b:,--- be an enumeration of the elements of the 
countable separable group J. It follows by complete induction from Corollary 
4.3 that there exist completely reducible direct summands J; of J such that 
r(J,) is finite, J; is a direct summand of J;, and the elements b; with 7 S 7 
are contained in J;. Then J; = S; + Ji, Si is completely reducible by 
Corollary 3.5 and J the direct sum of the completely reducible groups J; , S2, 
S;,---,i.e., J is completely reducible. 


Chapter III. Types of elements and subgroups 


5. Types of elements in separable groups. 
Derinition 5.1. The element b of the group J is a primitive element of genus s 
(in J), if 
b = 0mod (J, s S | x}), b F Omod (J, s < | 2}), 


m(b < J) = m(b < J/(J,s < |x\)) = mb < J(s)*). 


The subset F of J is primitive (in J), if F is finite, its elements are primitive 
elements in J and different elements of F have different genus in J. 

Lemma 5.2. The finite subset F of the separable group J is a primitive set in J 
if, and only if, 

(a) different elements of H have different genus in J; 

(b) the closed subgroup F of J, generated by F, is a direct summand of J and 
F is a basis of F. 

Proof. If the conditions (a) and (b) are satisfied, then every element in F 
is ~ 0 and generates a closed subgroup of J which is a direct summand of J, 
i.e., every element of F is primitive and F is therefore a primitive set. 
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If b is a primitive element in the separable group J, then the closed subgroup 
of J, generated by b, is by Theorem 4.2 a direct summand of J. If F is a primi- 
tive set in the separable group J, then F contains as a finite set an element b 
such that |f| <« |b| for every element f of F. Then J = 6 + J’, where b 
is the closed subgroup of J, generated by b, and the elements + 6 of F are all 
contained in J. Since by Lemma 4.6 also J’ is separable, and since elements 
of J’ are primitive in J’ if, and only if, they are primitive in J, (b) follows by 
complete induction from the proved facts. 

Corotiary 5.3. Suppose that the sets b, , --- , b, and bi, --- , by are primitive 
sets in the separable group J. Then there exists a (proper) automorphism a of J 
such that bia = b; for every i if, and only if, m(b; < J) = m(b; < J) for every i. 

Proof. Since J is separable, there exists a completely reducible direct sum- 
mand D of finite rank which contains all the elements b; and b;. By Lemma 5.2 


k k 
D=>6:+ B= Db + B, 
i=1 i=1 


where 6 is the closed subgroup of D, generated by b. Since b; and b; have the 
same multiplicity and therefore the same genus, and since D is a completely 
reducible group of finite rank, Theorem 2.8 and Corollary 2.9 imply that B 
and B’ are isomorphic. Since 6; and 6; are isomorphic, and since J = D + J’, 
there exists an automorphism a of J such that bia = b; for every i, ua = u 
for uin J’, Ba = B’, and this proves the corollary. 

Another consequence of Lemma 5.2 is 

Corouiary 5.4. If b is a primitive element of genus s in the separable group J, 
|b| # |c| and m(b < J)/m(c < J), then b + ¢ is also a primitive element of 
genus Ss. 

DeFIniTIon 5.5. s ts a regular or singular genus in the group J according as 
1 S r(J(s)*). 

Lemma 5.6. Suppose that b is a primitive element of genus s in the separable 
group J. 

(a) If s is singular, the primitive elements of genus s are exactly the elements 
r(b + b’), where b’' = 0 mod(J, s < |x|), m(b < J)/m(b’ < J) and risa 
rational number ~ 0 such that rb exists in J. 

(b) If s is regular, there exists a (primitive) element b’ of genus s in J such that 
b, b’ is a basis of a direct summand of J. 

Proof. If b is a primitive element of genus s in the separable group J, then 
J = 6 + J’, where 6 is the closed subgroup of J generated by b (Lemma 5.2). 
If b’ is an element of (J, s < | x |), then b’ is contained in J’, and if m(b < J) / 
m(b’ < J), then m(b + b’ < J) = m(b < J) andb = b+ bd’ mod(J,s < | 2)), 
i.e., b + b’ and every r(b + b’) ¥ 0 in J are primitive elements of genus s in J. 

If s is regular, then J’ contains an element b’’ such that b” = 0 mod(J’,s S 
|x|), b”’ # 0 mod(J’, s < |z|). Since J’ is by Lemma 4.6 also separable, 
b” is contained in a completely reducible direct summand D of finite rank for 
J’ and D has a rational direct summand of genus s, since (D,s S | x |) ¥ (D,s < 
| x |), and thus (b) is proved. 











82 REINHOLD BAER 


If finally s is singular and b” a primitive element of genus s in J, then b” = 
rb + c, where rb ¥ 0, is an element of 6 and ¢ an element of J’. Since s = 
|b’ | = | rb|, it follows that s S |c|, and since (J’,s S |x|) = (J’,s < |x|) = 
(J,s < |x!), cis an element of (J,s <|z|). Since b” = rb mod(J,s < | 2)}), 
i.e., m(b” < J) = m(rb < J) as b” is a primitive element of genus s, m(rb < J) / 
m(c < J) and therefore c’ = rc exists in J and b” = r(b + c’) has the re- 
quired form. 

We shall employ the notation 


m(b, s) = m(b < J,s) 
m(b, s+) = m(b < J, s+) 


mb < J/(J,| x4) £s)), 
mb < J/(J,|x\| €)). 


Lemma 5.7. Suppose that b, , --- , by is a primitive set in the separable group J 
k 
and that b = >> b;. 
$=} 


(a) If s ¥ | b; | for every i, then m(b, s) = m(b, s+). 
(b) If |b;| « | bi | = s for every j, then 


m(b < J,s) = &, m(b < J, s+) = m(b; < J). 
(c) If |b;| < |b: | = s for some j, then 
mb<J,s) = g.c.d. of all m(b; < J) with |b;| <8; 


m(b < J, s+) = g.c.d. of all m(b; < J) with | b;| Ss; 
m(b < J,s) # ™, m(b < J, s+) /m(b < J,s) 


and q(b, s) = m(b < J,s):m(b < J,s4-) ts an ordinary positive integer; m(b; < J) 

and m(b < J, s+) have the same finite p-value for every prime divisor p of q(b, s). 

Proof. If s is any genus, then b = 2, b, = > b; mod (J, |x| < s). 
s<jbi ss/b;| 

Since the elements b; form a primitive set in the separable group J, it follows 
from Lemma 5.2 that m(b, s) and m(b, s+) have the values given in the lemma. 
The other statements are consequences of these evaluations and of the fact that 
|u| < || implies the existence of a positive integer h such that m(u < J) / 
hm(v < J). 

Norations. If b # 0 is any element of the separable group J, sa genus such 
that m(b < J, s) # ~, q(b, s) therefore an ordinary positive integer, and g any 
(generalized) number, then 

(g, b, s)* ts the greatest divisor of g which is relatively prime to q(b, s); 


(g, b, s)** = g.c.d. of (g, b, s)* and (m(b, s), b, s)*? 
(g, b, s) = (g, b, s)*.(g, b, s)**. 





9 (g, b, s)** = [g.c.d. of (g, b, s)* and m(b, s)] = [g.c.d. of (g, b, s)* and m(b, s+)] by 
Lemma 5.7. 
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Note that by Lemma 5.7 
(m(b, s), b, s)* = (m(b, s+), b, s)*. 


Lemma 5.8. Suppose that, for every genus s, r(s) is a number of genus s. Then 
there exists corresponding to every element b # 0 of the separable group J a primitive 
set by , --- , by in J such that 


(a) bo Ea; 


(b) m(b < J, s) ¥ m(b < J, s+) if, and only if, (exactly) one of the elements 
b; is of genus s; 

(c) m(b < J, s) = ~, m(b < J, s+) = m(bi < J), if |b;| = sand m(b < 
J,t) = m(b < J, t+)(= ~) fort <s. 

(d) If sis a regular genus, | b;| = sand m(b < J,s) # ™, then 


m(b; < J) = m(b, s+)(r(s), b, s).”° 
(e) If s is a singular genus, | b; | = sand m(b < J,s) # ~, then 
m(b: < J) = m(b, s+)(r(s), b, s)hs ,” 


where h; is an ordinary positive integer, relatively prime to q(b, s) and chosen at 
random in its class H(h;, b, s) = H(bi, --- , bs. , b, S), where H(h, b, s) denotes 
the smallest class of ordinary integers which contains h; complete classes of residues 
mod q(b, s) and with n also every nn’ for n’ | r(s),, , n’ an ordinary integer. 

Proof. 1. Since J is a separable group, every element b + 0 of J is contained 
in a completely reducible direct summand D of J. There exists a smallest 
partial reduction of D and, if b(s) is the component of b in the summand, con- 
taining the elements of genus s, the elements b(s) ¥ 0 are primitive elements of 
genus s and form therefore a primitive set (in D and in J) whose sum is }, i.e., 
to every element b + 0 of J there exists a primitive set in J whose sum is b. 

2. If the elements w, b; , --- , b, form a primitive set in J whose sum is b, 
if | w | = sand m(b, s) = m(b, s+), it follows from Lemma 5.7 that m(b, s) ¥ ; 
that there exist elements b; , say bi: , --- , bs , such that | b;! < s if, and only if, 
1sjsih,0 <h sk; that 


c; = m(b;); {g.c.d. m(b;), m(w)} 
is for 1 S j S hk an ordinary positive integer; and that 1 is the g.c.d. of eq, --- , ca. 
There exist therefore integers c; such that Oe c,c; = 1 and a primitive set in J 
is defined by 7 
b; = bi + crecw (l Sis hj), 


b; = b; (h « t), 


1° m(b, st+)(r(s), b, s) = l.e.m. of m(b, s+) and (r(s), b, s)* = l.c.m. of m(b, s+) and of 
all divisors of r(s) which are relatively prime to q(b, s). 
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since |b; | < | w| and m(b;) / m(ew) = cm(w) for 1 < i S A, ice., b; and b; 
are primitive elements of the same genus and the same multiplicity in J (Corol- 
lary 5.4). Since the sum of this new primitive set is b, and since this new primi- 
tive set contains fewer elements than the old, we have the following result. 

A primitive set in J with sum b satisfies the condition (b) if, and only if, it 
contains as few elements as possible. There exist therefore to every element 
b ¥ Oin J primitive sets in J which satisfy (a) and (b). 

3. Suppose now that the primitive set bh, --- , by satisfies the conditions 
(a) and (b), that s is the genus of b; , and that m(b < J, s) # ©. Then some 
of the elements 6; satisfy | b; | < s and it can be assumed that exactly the ele- 
ments b; with 1 S j S A, 1 S A satisfy this condition. 

(m(b, s), b, s)* = (m(b, s+), b, s)* is by Lemma 5.7 the g.c.d. of the numbers 
(m(b;), b, s)* with 1 S j S hand therefore a true divisor of (m(b;), b, s)*. Then 


d; = (m(b), b, s)* ; (m(b, s), b, s)* 
is relatively prime to q(b, s). 
The numbers 
c; = m(b;); {g.c.d. m(b;), m(b,)} (lsjsh) 
are ordinary positive integers whose g.c.d. is q(b, s). 


Suppose now that w is an ordinary positive integer, dividing the finite part 
(d;), of d;. Then there exists a decomposition 


h 
w= Il Wj 
gi 


such that w; and w; are relatively prime for 7 # j’ and such that w;/ 
m(b;) ,{g.c.d. m(b;), m(b;)}, since w is relatively prime to q(b, s) and since 
m(b, s+) is the g.c.d. of the numbers g.c.d. (m(b;), m(b;)) with 1 Sj S A. 

Then c; and w; are relatively prime integers and the g.c.d. of the numbers 
cjwu,;' with 1 < j < h is exactly q(b, s). There exist therefore integers c; 
such that 


h n 
q(b, s) = >> c;e;(ww;') or g(b,s)w = Do cjcjw;' (40 mod 1). 
s™1 yi 


Since | b; | < s and w;m(b,) / c;m(b,) for 1 S j S h, a primitive set is defined by 


b; = b; (h <j # 7%), 
b) = b; + ne, c;w;'b; (l sj Sh), 
b; = (1 — nq(b, s)w™')b; (j = 4), 
where n is any given integer. This primitive set satisfies (a), (b), and 
m(b;) = m(b,) (j 0), 
m(b;) = | w — nq(b, 8) | w 'm(bi) (i = j). 





ADIT OK Se 








Ao ARO IK BSED 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 85 


Since it is possible to apply this construction” upon several indices 7 without 
changing the effect on the other indices, we have the following result. 

(5.8.3) Suppose that the elements b; form a primitive set, satisfying (a) and (b) 
and that for m(b, |b; |) # © n; is any integer and w; / d; = (m(b,), b, |b: |)*: 
(m(b, | b; |), b, | bs |)*. Then there exists a primitive set b; which satisfies (a) 
and (b) and 


m(b;) = m(b,), if m(b, |b; |) = @, 
= | wi — niq(d, |b; |) | wy m(d,), if m(b, |b; |) # &. 


4. Suppose that b,, --- ,b, is a primitive set, satisfying (a) and (b), and 
that the indices 7, h and the numbers w, w;, c;, c; for 1 < j < h and d; have 
the same meaning as during the proof of (5.8.3). Suppose furthermore that the 
genus s of b; is regular for the group J. Then there exists by Lemma 5.6b an 
element e in J such that b; , e is a basis of a direct summand of J and satisfies 
m(b; < J) = m(e < J). Since |b; | < s and w;m(b;) / ejm(b;) = c;m(e), and 
since é is a primitive element of genus s in J, it follows from Lemma 5.2 and 
Corollary 5.3 that a primitive set in J is defined by 


b; = b; (h<j #2), 
b; = bj + cj¢;w;'e (1 <j <h), 
b, = b; — w ‘q(b, s)e (j = i). 
This primitive set satisfies (a), (b) and 
m(b;) = m(b;) (j ¥ 2), 
m(b;) = [g.c.d. of m(b,) and w 'q(b, s)m(e)] = w 'm(b,) (j = 2), 


since m(b;) = m(e) and since w and q(b, s) are relatively prime. 

Since it is possible to apply this construction upon several indices 7 without 

changing the effect on the other indices, we have the following result. 
(5.8.4) Suppose that the elements b, , --- , b, form a primitive set which sati#- 
fies (a) and (b), and for every 7 such that m(b, | b; |) # © and | b;| isa regular 
genus for J the integer w;/d; = (m(b,), b, | b; |)* (m(b, | b: |), b, | bs |)*. Then 
there exists a primitive set b; , --- , b; which satisfies (a) and (b) and m(b;) = 
m(b;), if m(b, |b; |) = © or | b; | is a singular genus for J, or m(b;) = w;'m(b,), 
if m(b, | b; |) # © and | b; | is a regular genus. 

5. Suppose that b,, --- , b, is a primitive set which satisfies (a) and (b) 
and that m(b, |b; |) # @ if, and only if, 1. Sis2z,0 52k. Since r; = 
m(b, | b; | +)(r(| b; |), b, | bs |) has the genus | 6; | for 1 S i S z, there exists an 
ordinary positive integer n; which is relatively prime to q(b, |b; |) such that 
r; | n:m(b;) and there exist integers n;, n, such that 1 = niq(b, |b: |) + nin; . 
There exists therefore by (5.8.3) a primitive set bj, --- , b; which satisfies (a) 





1 Since m(rb,) = m(b;), if numerator and denominator of r are divisors of the infinite 
part of m(b;), it suffices for this construction to assume that w is a divisor of d; . 
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and (b) and 
m(b;) 
m(b;) 


m(b;) (z 


< 
| nen; | m(b;) (l sis 2). 


Thus it follows from Lemma 5.7 that 
(5.8.5) There exists a primitive set b; , --- , bs which satisfies (a), (b), (¢) and 
the additional condition (d*). 


(d*) If m(b, |b: |) # &, then 
m(bi) = m(b, | bi | +)(r(| Br |), B, | Bs [Ai , 


where h;; is an ordinary positive integer which is relatively prime to q(b, | b; |). 

6. Since the numbers h; in (5.8.5d*) satisfy the condition imposed on the 
numbers w; in (5.8.3), it follows that these numbers h; can be chosen at random 
in their class of residues mod q(b, | b; |); and since m(b;) = pm(b,), if pis a 
prime number which divides the infinite part r(| b; |), of m(b,), it follows 
that h; can be chosen at random in its class H(h;, b, | b; |). 

7. If m(b, |b: |) # «& and |b, | is a regular genus, then the numbers h; in 
(5.8.5d*) satisfy the condition imposed upon the numbers w; in (5.8.5), and the 
existence of a primitive set which satisfies (a) to (e) follows therefore from 
(5.8.5) and the facts proved in heading 6. 

Let s be a singular genus for the separable group J; n an ordinary positive 
integer which is relatively prime to the infinite part of the numbers of genus s; 
Y = Y(J, n, s) the subgroup of J generated by nJ and (J, |x| £ s); and 
Z = Z(J,n,s) the group of those classes of J/Y(J,,s) which contain elements 
of (J, |2| <s). 

Since J is a separable group and s a singular genus, it follows from Theorem 
2.8 that there exists a rational subgroup R of genus s and a subgroup J’ such 
that 


J=R+/',(J,\4' £8) =R+(J,|2| €s), J,ss{[r|)=R+CU,8 
< |r|), WJ’, |2| ts) = VW’, |z| £8), U’,s <|z|) = W’,s Ss |z)). 


Since every class of Z(J, n, s) contains elements of R, i.e., primitive elements 
of genus s and since every primitive element of genus s is contained in a class of 
Z, it follows that Z(J, n, s) and R/nR are isomorphic, i.e., since n is relatively 
prime to the infinite part of the numbers of genus s, Z(J, n, Ss) is a cyelic group 
of the finite order n. 

nJ, (J, |x| £ s) and (J, |a2| <« s) are characteristic subgroups of J, and 
every (proper) automorphism of J induces therefore an automorphism of 
Z(J, n, s). Two elements z and 2’ of Z are called conjugate, if there exists an 
automorphism of J which maps z upon 2’. The elements of Z are thus distributed 
into classes of conjugate elements. 

Lemma 5.9. Suppose that s is a singular genus of the separable group J and 
that the positive integer n is relatively prime to the infinite part of the numbers of 
genus Ss. 
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(a) The elements z and z' of Z(J, n, S) are conjugate if, and only if, there exist 
positive integers h,h’ which are divisors of the infinite part of the numbers of genus s 
and which satisfy hz = h’'z’. 

(b) The elements z and z' of Z(J, n, 8) are conjugate if, and only if, the primitive 
elements of genus s, representing z, have the same multiplicities as those repre- 
senting z’. 

(c) The primitive elements b and b’ of genus s represent conjugate elements of 
Z(J, n, s) if, and only if, 

(c’) b and b’ have the same order q mod Y(J, n, s); 

(e’’) there exist ordinary integers k and k’ which are relatively prime to q and 
satisfy 

k = k’ mod q, m(kb < J) = m(k’b! < J). 


Proof. If h, h’ are ordinary positive integers, dividing the infinite part of the 
numbers of genus s, and if z and 2’ are elements of Z(J, n, s) such that hz = 
h’z’, then z’ contains a primitive element b’ of genus s and the primitive element 
b = hh’ 'b’ of genus s in z satisfies m(b) = m(b’), i.e., z and 2’ are represented 
by primitive elements of the same multiplicity. 

If z and 2’ are elements of Z which are represented by primitive elements 
b and b’, respectively, such that m(b) = m(b’), there exists by Corollary 5.3 an 
automorphism of J which maps b upon b’ and therefore z upon 2’, i.e., z and 2’ 
are conjugate. 

Suppose that z and z’ are conjugate and that the primitive elements b and b’ 
of genus s represent z and z’ respectively. Then there exists an automorphism 
of J which maps 2’ upon z and therefore b’ upon the element b’’ representing z. 


Moreover, m(b’) = m(b’’). Since b, b” are primitive elements of the singular 
genus s, it follows from Lemma 5.6a that 
b” = r(b + ce) 


where m(b’’) = m(rb), rb exists in J andc = Omod (J,s < |x). Since b” =6 
mod Y(J, n, s) and rb = r(b + c) mod Y, this implies b = rb mod Y and there- 
fore b = rb mod nJ, b = rb mod nb, where b is the closed subgroup of J, gener- 
ated by b. The order q of b mod Y is nn’ where n’ is the g.c.d. of n and m(b). 
Since b and rb have the same order mod Y, n’ is also the g.c.d. of n and m(rb) 
and therefore n’"(r — 1)b = 0 mod qb, q and m(n’~'b) are relatively prime. If 
r = rr’, where r’ and r” are relatively prime integers, then r’’ is relatively 
prime to qg (as a divisor of m(n’~'b)) and r’ = r” mod q and furthermore m(r’b) = 
m(r’’b’), i.e., b and b’ satisfy the conditions (c’) and (c’’). 

Suppose now that the primitive elements b and 6’ of genus s satisfy the con- 
ditions (c’) and (c’’). Since k is relatively prime to q, there exists an integer 
k” such that kk’ = 1 mod q and since b = b’ = 0 mod nq ‘J, it follows that 
b = kk’’b = k’k"’b mod Y(J, n, 8), b’ = kk’'b! = k’k’’b' mod Y, and since m(kb) = 
m(k’b’), this implies that kb and k’b’ and therefore b and b’ represent conjugate 
elements of Z. 

If z and 2’ are conjugate elements of Z, b a primitive element of genus s, 
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representing z, then there exist relatively prime integers h, h’ such that m(b) = 
m(hh’'b) and hh’~'b represents z’.. Then h and h’ divide the infinite part of the 
numbers of genus s, and hz = h’z’. This completes the proof of the lemma. 

Noration. If b # 0 is an element of the separable group J, s a singular genus 
of J such that «© # m(b,s) # m(b, s+), then n(b, s) = m(b, s);(m(b, s), b, s)*. 

Coroutuary 5.10. (a) If b ¥ 0 is an element of the separable group J, s a 
singular genus such that ~ #~ m(b, s) # m(b, s+), then b represents a certain 
element of order q(b, s) of Z(.J, n(b, s), 8) and therefore a class C(b, s) of conjugate 
elements of Z(.J, n(b, s), $). 

(b) Suppose that, for every genus t, r(t) is a number of genus t and that the ele- 
ments b and b’ of the separable group J satisfy » # m(b,s) = m(b’,s) ~ m(b, s+) 
= m(b’,s+) for a certain singular genus s. Then n(b,s) = n(b’,s) and q(b,s) = 
q(b’, s). Moreover, C(b, s) = C(b’, s) if, and only if, H(i, --- , bh, by s) = 
H(bi, --- , by’, b’, s), where by, --- , bd, and bi, --- , by’, are primitive sets in J with 
sum b and b’ respectively which satisfy the conditions (b) to (e) of Lemma 5.8. 
H(b,, --- , be, b, S) = H(b, s) depends therefore only on b, s, and the numbers r(t), 
representing the genera t, but not on the particular representation of b by a “‘canon- 
tical” primitive set. 

Proof. If b ¥ 0 is an element of the separable group J, then there exists a 
primitive set b; , --- , bd , satisfying the conditions (a) to (e) of Lemma 5.8. If 
s is a singular genus such that » # m(b, s) # m(b, s+), then exactly one of the 


elements b; , say 6; has the genus sin J. Then b = b; mod Y(.J, n(b, s), s) and * 


now the corollary is a consequence of Lemma 5.8 and Lemma 5.9. 

Derinition 5.11. The subsets S and T of the group J are isotype (have the same 
type in J) if there exists a (proper) automorphism of J which maps S upon T. 

THEOREM 5.12. The elements b and b’ of the separable group J are isotype in J 
if, and only if,” 

(a) m(b, s) = m(b’, s) and m(b, s+) = m(b’, s+) for every genus s; 

(b) C(b, s) = C(b’, s) for every singular genus s such that ~ 4 m(b, s) = 
m(b’, s) # m(b, s+) = m(b’, s+) and therefore n(b, s) = n(b’, s), q(b, s) = 
q(b’, s). 

Proof. m(b, s), m(b, s+) and C(b, s) are invariants of the element b in the 
group J, since for their definitions only characteristic subgroups of J have been 
used. If the conditions (a) and (b) are satisfied by the elements b and 0b’, 
there exists by Lemma 5.8 and by Corollary 5.10 a primitive set b; , --- , bg with 
sum 6 and a primitive set b, , --- , bj with sum DB’ in J such that m(b; < J) = 
m(b; < J) for 1 < i < k. There exists therefore by Corollary 5.3 an auto- 
morphism a@ of J such that bia = b;, i.e., such that ba = b’. 

Coro.uary 5.13. Suppose that b; is an element of the separable group J; and 
that J; and Jz are isomorphic. Then there exists an isomorphism of J, upon Jo 
which maps b; upon be if, and only if, 

(a) m(bi < Ji, 8) = m(be < Je, 8) and m(h, < J:,8+) = m(be < J2, 8+) 
for every genus $8; 


12 A reason for the so different nature of the invariants m(b, s), m(b, s+) on the one 
hand and the invariants C(b, s) on the other will be given in Corollary 6.11. 
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(b) the primitive elements of genus s, representing C(b; , $s), have the same multi- 
plicities as those representing C(bz , s) for every singular genus s of J, and of J 
such that 


co m(b; < Ji,8s) = m(b; < Ji, 8+). 


For an isomorphism of J upon J; maps b: upon an element 6b; such that 
b, and bj satisfy in J, the conditions (a) and (b) of Theorem 5.12 if, and only if, 
the conditions (a) and (b) of the Corollary 5.13 are satisfied. This follows from 
Lemma 5.9 and Corollary 5.10. 

Coro.uary 5.14. The elements b and b’ of the separable group J satisfy the 
condition (a) of Theorem 5.12 if, and only if, 

(1) m(b, s) # m(b, s+) if, and only if, m(b’, s) # m(b’, s+); 

(2) «© # m(b, s) ¥ m(b, s+) if, and only if, < # m(b’,s) + m(b’, s+); 

(3) m(b, s+) = m(b’, s+), if m(b, s) # m(b, s+). 

Or (3) can be replaced by 

(3’) m(b, s+) = m(b’, s+), if © = m(b, s) + m(b, s+); 

(3’") g(b, s) = g(b’, s), if ~ # m(b, s) ¥ m(b, s+). 

This follows from Lemma 5.7 and Lemma 5.8, since there exists but a finite 
number of genera s such that m(b, s) # m(b, s+) and since m(b, s+) # m(b, s) 
~ o implies that m(b, s) is the g.c.d. of those m(b, t+) with t < s and m(b, t) # 
m(b, t+). 


6. Characteristic subgroups of separable groups. 

DeFINITION 6.1. The subgroup S of the group J is a characteristic subgroup of 
J, if S is mapped upon itself by every proper automorphism of J. S is regular, if 
every proper or improper automorphism of J maps S upon a subgroup of S.A 
characteristic subgroup of J which is not regular is singular. 

Regular subgroups are usually called strictly characteristic, singular ones 
characteristic, but not strictly characteristic. The above notation has been 
adopted for brevity. 

nJ and (J, 8 S ||) are examples of regular subgroups. Intersection and 
join of characteristic or regular subgroups are characteristic or regular subgroups. 

Lemma 6.2. Suppose that b ¥ 0 is an element of the characteristic subgroup S 
of the separable group J and that the genus s satisfies m(b, s) # m(b,s+). Then 
the element w of J is contained in S, if one of the following conditions is satisfied: 

1. s < | w|, m(b, s+) / m(w). 

2. s = | w| and 2m(b, s+) | m(w) 

or m(b, s+) | m(w) and S is regular 
or & is regular 
or m(b, s) # & and q(b, s) is odd. 

Two particular instances of this lemma will be needed for its proof. 
(6.2.1) Suppose that the characteristic subgroup S of the separable group J contains 
the sum b of the primitive set b, , --- ,b, in J. 

(a) 2b; is contained in S. 

(b) b; ts contained in S, if S is regular or | b; | is regular. 
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k 
Proof. By Lemma 5.2 there exists a subgroup J’ such that J = >> 6; + J’, 
fo | 


where b; is the closed subgroup of ./, generated by b;. A proper automorphism 
a of J is therefore defined by bj;a = 6; for j # 7, ua = u for u in J’, bja = 
—b; for j = 7, and an improper automorphism 8 is defined by b;8 = b; for 7 ¥ 2, 
uB = uforuin J’, 6,8 = 0 for j = 7. Then b — ba = 2b; is contained in S. 
If S is regular, then b — b8 = b; is contained in S. If finally | b; | is regular, 
there exists by Lemma 5.6b an element w in J’ such that m(w < J) = m(b; < J) 
and J’ = © + J”, where @ is the closed subgroup of J generated by w. Then 
a proper automorphism y of J is defined by by = b; for 7 ¥ 7, wy = u for win 
J’, by = bi + w for j = i, and therefore b — by = —w is contained in S. But 
since —w and b; are primitive elements of the same genus and the same multi- 
plicity in J, it follows from Corollary 5.3 that b; is contained in S. 

(6.2.2) If the characteristic subgroup S of the separable group J contains the 
primitive element b, then S contains every element c such that m(b) | m(c). 


Proof. cis by Lemma 5.8 the sum of a primitive set c , --- , c, and Lemma 
5.7 implies m(b) / m(c) / m(e,). If |c:| = |b], then there exists an ordinary 


positive integer h such that hm(b) = m(hb) = m(c;). Since hb is an element of 
S, it follows from Corollary 5.3 that c; is contained in S. If |b| < | ¢;|, then 
it follows from Corollary 5.4 that b + ¢; is a primitive element of the same genus 
and the same multiplicity as b and it follows therefore from Corollary 5.3 that 
b + c; and consequently c; are elements of S. 
(6.2.3) If the assumptions: 

bi, --- , bd, is a primitive set in the separable group J, complying with the condi- 
tions of Lemma 5.8; 

b = >> Dd; is an element of the characteristic subgroup S of J; the genus s of bi 

=l1 

is singular for J; 

co = m(b,s) * m(b, s+); 

m(b, < J) = m(b, s+)fh, where h is an ordinary positive integer which is rela- 
tively prime to q(b, s); 
are satisfied, then S contains q(b, s)h"'b, and there exists an integer h' which is 

k 


relatively prime to q(b, s) such that h"'b, + h’ + b; ts contained in S. 


Proof. By Lemma 5.8 there exists a primitive set bj, --- , b; with sum b 
in J such that 

m(b;) = m(b,) (¢ ¥ 1), 

m(b;) = m(b, s+)f(h + 9(b, s)) (i = 1). 


k 
Corollary 5.3 implies therefore that (h + q(b, s))h-'b, + z b; and consequently 


(b, s)h"b; are elements of S. Since h and q(b, s) are relatively prime, there 
q | 
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exist integers h’, q’ such that hh’ + q’q(b, s) = 1 and 


k 
h*b; + h’ Sb; = h’b + q'q(b, s)h-'b: 
i=2 


is an element of S. 

Proof of Lemma 6.2. Suppose that t is the genus of w. Then it is possible 
to choose the number r(u) of genus u in such a way that r(u) /m(w < J) for 
u <t. There exists by Lemma 5.8 and (6.2.3) a primitive set b; , --- , b. in J 
whose sum b’ is an element of S such that s is the genus of b; and m(bi < J) = 
m(b, s+), if m(b, s) = «©, m(b, < J) = m(b, s+)(r(s), b, s), if m(b, s) # ~. 

If s < tand m(b, s+) / m(w), then m(b;) / m(w) and b; + w is therefore by 
Corollary 5.4 a primitive element of genus s, satisfying m(b:) = m(b,; + w). 
Corollary 5.3 implies therefore that b’ + w and consequently w are elements of S. 

If s is regular or S is regular, then b; is by (6.2.1) an element of S. If m(b, s+) 
/ m(w), then m(b;) / m(w) and w is therefore by (6.2.2) an element of S. 

2b; is by (6.2.1) an element of S. If 2m(b, s+) / m(w), then m(2b,) / m(w) 
and w is therefore by (6.2.2) an element of S. 

Coro.uary 6.3. If the primitive set b, , --- , by in the separable group J is a 
subset of the characteristic subgroup S of J and if the element b # 0 in J satisfies 
lem. (| bi |, ---, | be |) S |b], g.e.d. (m(bi), --- , m(be)) | m(b), then b is an 
element of S. 

Proof. From the assumptions there exist relatively prime positive integers 
h; such that m(b;) | him(b) = m(h,b). Since b; is an element of S, (6.2.2) implies 
that hb is an element of S. Since the h; are relatively prime, there exist integers 
h; such that > hth; = landb = t hi h:b is an element of S. 

$i o=3 

DertniTion 6.4. If S is a subgroup of the separable group J, s a genus, then 
g(S, s) = g(S < J, s) is the g.c.d. of all the multiplicities m(b < J) of primitive 
elements b of genus s in J which are elements of S. 

Note that g(S, s) = ©, if S does not contain primitive elements of genus s. 

TuHeEorEM 6.5. (a) If S is a characteristic subgroup of the separable group J, 
then S contains every primitive element b of genus s in J such that g(S, s) 
/m(b < J). 

(b) If S is a regular subgroup of the separable group J and T a characteristic 
subgroup of J, then S S T if (and only if) g(T, s) | g(S, s) for every genus s. 

(ec) If S and T are regular subgroups of the separable group J, then S = T if 
(and only if) g(T, s) = g(S, s) for every genus s. 

Proof. m(b < J) has the same infinite part for every primitive element b 
of genus sin J. If the prime number p does not divide the infinite part of the 
numbers of genus s, and if g(S, s) # ©, there exists a primitive element b, 
of genus s in J which is an element of S such that the p-values of m(b, < J) 
and g(S, s) are equal (and finite). Let now b be a primitive element of genus s 
in J such that g(S, s) / m(b < J). S contains a primitive element b’ of genus s. 
If m(b’ < J) / m(b < J), then m(b < J) = m(hb’ < J) for some integer h ¥ 0, 
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and b is an element of S by Corollary 5.3. If m(b’ < J) is not a divisor of 
m(b < J), there exists a positive integer k which is relatively prime to the in- 
finite part of m(b) such that m(b’ < J)/m(kb < J) = km(b < J). kb is 
therefore an element of S. Let h be the smallest positive integer such that Ab 
isan element of S. Thenh/k. Furthermore there exists in S by Corollary 5.3 
a multiple of b whose multiplicity has the same p-value as m(b, < J). If 
k ~ land p/ k, there exists in S an element wp ‘b, where w is a positive integer 
relatively prime to p. If w’w + k’k = g.c.d. of wand k, then (w’w + k’k)b is an 
element of S and 0 < w’'w + k’k <k. That being impossible, k = 1, and b is an 
element of S. 

If S is regular and the sum b of the primitive set b; , --- , b, an element of S, 
then by (6.2.1) every b; is contained in S. If the characteristic subgroup T of 
J satisfies g(7, s) / g(S, s), it follows from (a) that S < T, i.e., (b) is true. (ec) 
is a consequence of (b). 

THEOREM 6.6. Let, for every genus s, g(s) be either a (generalized) number or the 
symbol ~. Then there exists a regular subgroup S of the separable group J such 
that gS < J,s) = g(s) for every genus s if, and only if, 

(a) g(s) = ~, if VJ,s< |r|) = V,s Ss |r)}); 

(b) of g(s) # @, then 

(b1) | g(s)| Ss; 
(b2) the infinite part of g(s) is the infinite part of the numbers of genus s; 
(b3) the finite part g(s); / g(t) for every t < s; 

(c) of g(t) = ~,t < sand (J,s <|2|) ¥ (J,s S| 2)), theng(s) # ~. 

Proof. A. Suppose that S is a regular subgroup of the separable group J. 
If (J,s < |x|) = (J,s S | x}), there do not exist primitive elements of genus s 
in J, i.e., g(S, 8) satisfies (a). That g(S, s) satisfies (b1) and (b2) is obvious. 
(b3) and (c) are consequences of Corollary 6.3. 

B. Suppose that the numbers g(s) satisfy the conditions (a) to (c). Then put 


S(s) = VJ; s S |x|, g(s)/m(@ < J)). 


Thus for every genus s a regular subgroup of J has been defined. S(s) = 0 if, 
and only if, g(s) = © (by (c)). S(s) consists exactly of the elements x # 0 with 
s <= |2|,g(s)/m(x). (a), (bl), (b2) and (ce) imply that g(S(s) < J, s) = g(s). 

Now let S be the subgroup of J, generated by all these subgroups S(s). As 
the join of regular subgroups, S is a regular subgroup of J. Since by Lemma 5.8 
every element of S(s) is the sum of primitive elements b such that s < | b|, 
g(s) / m(b), it follows that every primitive element s of genus t which is con- 
tained in S has the form s = u + v + w, where 


“u= ® u;, Us primitive, | u;| < t, g(| ui |) / m(us); 
v = >. vj, »; primitive of genus t, g(t) / m(v,); 


w= Y¥ w;, wW; primitive, | w;| £ t, g(| ws!) / m(wi). 
t 
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Then s = u + vmod (J, |x| € t), and since s is a primitive element of genus t 
in the separable group J, it follows from Lemma 5.2 that m(u + v) / m(s). 

g(t) / m(v;) implies g(t) / m(v). 

g(t), = m/(s), , since t is the genus of s and g(s) satisfies (b2). 

g(t); / g(| us|) | m(u,), since | u; | < t and g(s) satisfies (¢) and (b3). There- 
fore g(t); /m(u). Thus it has been proved that g(t) /m(u + v)/ m(s) and 
therefore that g(t)/g(S < J, t). Since S(t) < S, it follows that g(S, t) / 
g(S(t), t) = g(t), i.e., g(s) = g(S,s). 

Coro.iary 6.7. Suppose that b ¥ 0 is an element of the separable group J 
and that S is the smallest regular group of J which contains b. Then 


g(S,s) = ©, if m(b,s+) = ~ orif(J,s S |x|) = (J,s < |2}); 


If m(b, s+) # ~ and (j,s S |x!) ¥ (J, s< jx }), then 
g{S, s) = m(b < J, s+), if m(b < J,s) = &, 


and if m(b < J, s) # ~, theng(S,s); = m(b < J, s+), and g(S, s),, is the 
infinite part of the numbers of genus s. 

Proof. As a consequence of Theorems 6.6 and 6.5c there exists one and only 
one regular subgroup S of J such that g(S, s) has the value given in the Corollary 
6.7. This regular subgroup S contains the element b, since the elements b; of 
a primitive set with sum 6 are by Corollary 5.7 and by Theorem 6.5a contained 
in S. If finally 7 is a regular subgroup of J which contains b, then S < T by 
Lemma 6.2 and Theorem 6.5a, i.e., S is the smallest regular subgroup of J 
which contains b. 

Coro.uary 6.8. The elements b and b’ of the separable group J are contained 
in the same regular subgroups of J if, and only if, m(b, s) = m(b’, s) and m(b, s+) 
= m(b’, s+) for every genus s. 

For the smallest regular subgroup of J which contains b depends by Corollary 
6.7 only on the values of m(b, s) and m(b, s+). 

Thus the type of an element is generally not completely determined by the 
regular subgroups which contain the element. 

Coro.tuary 6.9. If J is a separable group, the following statements are 
equivalent : 

(a) Two elements of J are isotype in J if, and only if, they are contained in the 
same regular subgroups of J. 

(b) There does not exist a genus w with the following properties: 

(b1) wis a singular genus for J; 
(b2) there exists a genus s such thats < wand (J,s < |x|) ¥(J,s S/|2)); 
(b3) there exists an ordinary positive integer n which is relatively prime to the 
infinite part of the numbers of genus w such that Z(J, n, w) contains elements 
of order n which are not conjugate. 

This is a consequence of Corollary 6.8, Lemma 5.8, Corollary 5.10b and 
Theorem 5.12. 

Noration. If b # 0 is an element of the separable group J, s a genus, then 
q(b, s)’ is either 1 or 2 and q(b, s)’ = 2 tf, and only if, 
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(1) s is singular for J and 2 relatively prime to the infinite part of the numbers 
of genus Ss; 

(2) m(b, s) # m(b, s+) and either 

(21) m(b, s) # ~&, q(b, S) even or 
(22) m(b, s) = @. 

THEOREM 6.10. Suppose that b # 0 is an element of the separable group J, 
that C is the smallest characteristic subgroup of J containing b, and that R is the 
greatest regular subgroup of J contained in C. 

(a) C = R if, and only if, at most one of the numbers q(b, s)’ = 2. 

(b) If R < C, then 

(b1) C/R is a cyclic group of order 2 (and consists therefore of the elements 
Rand R + b); 

(b2) g(R < J,s) = ~,if(J,s<|x|) =(J,s S| 2]) orifm(b,s+) = ~, 
g(R < J, s); = q(b, s)'’m(b, s+); and 

g(R < J,s),, is the infinite part of the numbers of genus s, if (J, s < |x|) # 
(J,s S |2]|) and m(b, s+) # ~. 

Proof. Since q(b, s)’ / q(b, s), if m(b, s) # «, there exists by Theorem 6.6 
one and by Theorem 6.5c¢ only one regular group R such that the values of 
g(R, s) are those given in (b2). By Lemma 6.2, R < C. 2b is an element of 
R by Lemma 5.8 and Theorem 6.5a. 

If every g(b, s)’ = 1, then R is by Corollary 6.7 the smallest regular subgroup 
of J containing b, and therefore R = C is a regular subgroup of J. 

If at least one of the q(b, s)’ = 2, then b is not an element of R, as follows from 
Lemma 6.2 and Lemma 5.7, 5.8. But 2b is an element of R and the elements 
of R and R + b form therefore a group C’ which contains b and satisfies R < 
C’ < C. C’/Risa eyclic group of order 2. 

Let b;, --- , db be a primitive set with sum 6 and satisfying the conditions 
of Lemma 5.8. If s is the genus of b; , then b; is an element of R if, and only if, 
q(b,s)’ = 1. If q(b, |b; |)’ = 2forl Si Sh, thenl Sh S kand 


h 
b; mod R. 
i=1 


b=b' = 


Let a be a proper automorphism of J. Since | b; | is for 1 S i S ha singular 
genus, it follows from Lemma 5.6 that bia = ri(b; + b;), where r; is a rational 
number # 0 whose numerator and denominator are divisors of the infinite 
part of the numbers of genus | b; | and where b; is an element of (J, | bi | < | 2 |) 
satisfying m(b,) | m(b;). 

Now s < t implies that q(b, t)’m(b, t+), / m(b, s+); (if q(b, t)’ is defined), 
since the 2-value of m(b, t+) is smaller than the 2-value of m(b, t) and therefore 
than the 2-value of m(b, s+), if q(b, t)’ = 2. Furthermore, b; is the sum of a 
primitive set bi , --- and, since | b; | < | bj; |, it follows that 


q(b, | bi; |)’m(b, | bis | +)r | m(b, | bs | +), | m(bi) | m(b;) | m(b;)), 


. ye 
i.e., b;; and consequently 6; is an element of R. 
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If r; = r;r;’, then r’, r” are both odd (if relatively prime) and since 2b; and 
(by Corollary 5.3) 2r;*b; are elements of R, it follows that b; — bsa = 0 mod R. 
Since F# is a characteristic subgroup, C’ is therefore also a characteristic sub- 
group, and since b is an element of C’ S C, it follows that C’ = C. 

If exactly one q(b, s)’ = 2, i.e., h = 1, then b is the sum of a primitive set in 
J which is contained in C. C is therefore generated by the primitive elements 
in J which are contained in C, i.e., C is by Theorem 6.6 (and its proof) regular. 

If 2 < h (in the above notation), then b; and be are not contained in R and 
therefore not in C, which is singular by (6.2.1). 

Coro.uary 6.11. The elements b and b’ of the separable group J are contained 
in the same characteristic subgroups of J if, and only if, m(b, s) = m(b’, s) and 
m(b, s+) = m(b’, s+) for every genus §s, 7.e., if, and only if, b and b’ are contained 
in the same regular subgroups of J. 

Proof. If b and b’ are contained in the same characteristic subgroups of J, 
then they are contained in the same regular subgroups of J. If they are con- 
tained in the same regular subgroups, then m(b, s) = m(b’, s) and m(b, s+) = 
m(b’, s+-) for every genus s by Corollary 6.8. If b and b’ satisfy these identities, 
then it follows from Theorem 6.10, since q(b, s)’ = q(b’, s)’, that R(b) = R(0’), 
where R(u) is the greatest regular subgroup of the smallest characteristic sub- 
group C(u) of J which contains u. If furthermore b,, --- , b; is a primitive 
set with sum b and'bj,, --- , bj a primitive set with sum b’, b; not contained 
in R(b), then 2b; is contained in R(b), q(b, | b; |)’ = 2and therefore q(b’, | b; |)’ = 
2, i.e., | bj | = | bs | and b; — b; = O mod R, ice., C(b) = C(b’). Note that C(d) 
is completely determined by R(b) and the set of genera s such that g(b, s)’ = 2. 

Coro.uary 6.12. There exist singular subgroups of the separable group J if, 
and only if, there exist at least two different singular genera s such that the 2-values 
of the numbers of genus s are finite. 

This is a consequence of Theorem 6.10, since the join of regular subgroups 
is a regular subgroup, and since every characteristic subgroup S is the join of 
the groups C(s) with sin S. 

Survey of the singular subgroups. If C is any characteristic subgroup of the 
separable group J, then C contains a greatest regular subgroup R(C) and is 
contained in a smallest regular subgroup S(C). By Lemma 6.2 and Theorem 
6.10 all the elements # 0 of S(C)/C and C/R(C) are of order 2. 

Suppose now that C is singular and b* + 0 an element of C* = C/R(C). 
Then denote by S(b*) the set of all the genera s such that q(b, s)' = 2 for every 
element b in the class b*. S(b*) contains at least two different genera and there 
exists in b* an element bp such that m(bo , s) # m(bo , s+) if, and only if, s is 
contained in S(b*) and q(bp, s)’ = 2, if s is contained in S(b*). S(b* + c*) 
contains exactly those genera which are contained either in S(6*) or in S(c*) 
but not in both these sets. 

Thus in the set T = T(C) of sets S(6*) with 6* in C* an addition has been 
defined, such that T becomes an abelian group (S(0) = 0) and such that the 
correspondence S(b*) between C* and T is an isomorphism. 
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Finally if s is an element of S(b*), then s is a singular genus for J and the 
2-value of g(R(C), s) is positive and finite. 

If conversely C* is an abelian group whose elements ~ 0 are of order 2, R a 
regular subgroup of the separable group J and S(b*) a function of the elements 
of C* which has all the mentioned properties, then there exists a characteristic 
subgroup C of J such that R = R(C), C* = C/R(C) and S(b*) is the above iso- 
morphism between C* and T(C). 

Finally C is uniquely determined by R(C) and S(b*). 

Characteristic decompositions. A direct decomposition of a group J is a char- 
acteristic decomposition, if all the summands are characteristic subgroups. 
Every characteristic direct summand is regular.” 

The group J is characteristic irreducible, if J is a summand of every charac- 
teristic decomposition. It may happen that J is characteristic irreducible 
and that there exists a characteristic direct summand + 0, + J of J. There 
exists at most one characteristic decomposition in characteristic irreducible 
direct summands.” 

Suppose now that the group J is separable and partially reducible. The 
set P(./) of all the genera s with (J, s S | 2 |) ¥ (J, s < | x |) can be decomposed 
into sets S such that 

every genus is contained in exactly one set S; 

every S contains genera; 

ifs < t are genera in P(./), then they are contained in the same S$; 

every S is as small as possible. 

This decomposition of P(./) is unique. Let now J(S) be the join of the sub- 
groups (J, s < |x|) withsinS. Then J is the direct sum of the groups J(S) 
and this decomposition is a characteristic decomposition into characteristic 
irreducible, direct summands. 


7. Types of subgroups. The following concepts and facts concerning abelian 
groups without elements of infinite order will be used in the course of this section. 

If F is an abelian group without elements of infinite order, then the order of F 
is the |.c.m. of the orders of its finite subgroups; the rank m(F) of F is the small- 
est number m such that every finite subgroup of F can be decomposed into a 
direct sum of at most m cyclic groups. (Only groups of finite rank will be 
considered. ) 

If the rank m(F) of the abelian group F without elements of infinite order is 
finite, there exists a canonical decomposition of F into a direct sum of m(F) 
groups F; of rank 1 such that the order of F; is a divisor of the order of Fi4.. 
The orders of the summands F; of a canonical decomposition do not depend 
on the particular canonical decomposition and determine the structure of F 
completely. They are the invariants m(F), --+ , Mnce)(F) of F. 

If F is in particular a finite group, then the groups F; are cyclic groups and 


13 See R. Baer, The decomposition of abelian groups into direct summands, Quart. Jour. 
of Math., (2), vol. 6 (1935), pp. 222-232. 








ene 


vos PERT 


& 








Sees TEI 


ed, 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 97 


the subset B of F is a basis of F, if the cyclic subgroups of F, generated by the 
elements in B, form a canonical decomposition of F. 
If F is of finite rank and S a subgroup of F, then S and F/S are of finite rank. 
If A is any abelian group, the subset F(A) of all the elements of finite order 
in A is a subgroup of A and A/F(A) does not contain elements ~ 0 of finite order. 
If J is an abelian group without elements ~ 0 of finite order, S a subgroup of J, 
then the closed subgroup S of J which is generated by S satisfies F(J/S) = S/S. 
If S is of finite rank, then F(J/S) is of finite rank and m(F(J/S)) S r(S). 
If S is a subgroup of the (abelian) group J (without elements ~ 0 of finite 
order), then the defect of an element b # 0 of S in J is 


d(ib < S < J) = m(b < J)im(b < 8S). 


An element b # 0 of S has the same genus in S and in J if, and only if, its defect 
is an ordinary integer. 

If Rk’ * 0 is a subgroup of the rational group R, all the elements ~ 0 of R’ 
have the same defect (R:R’). Then (R:R’) is the order of the group R/R’ of 
rank 1 and R = (R:R’)'R’. 

Lemma 7.1. (a) The completely reducible group J of finite rank is finite mod 
its subgroup S if, and only if, 

(a’) r(S) = r(J), 
(a’’) the elements + 0 of S have the same genus in S and in J. 

(b) If S is a subgroup of the direct sum J of a finite number of isomorphic 
rational groups (all of genus s), and if F = J/S is finite, there exist decompositions: 


r(J) m(F) r(J) 
b+) J=2R, S=Pm(P)R+8', S'= DR, 
i=1 i~1 t=m(F)+1 


where the R; are groups of genus s, S’ is a greatest closed subgroup of J contained 
in S, m(F)R; is for 1 S i S m(F) the intersection of S and R; , m(F) the finite 
rank and m;(F) the finite invariants of the finite group F = J/S,and m(F) S r(J/). 

Proof. If J is a completely reducible group of finite rank and J/S is finite, 
then r(J) = r(S), since every element of J has finite order mod S. If b # 0 
is an element of S, b the closed subgroup of J, generated by b, 6’ the intersection 
of S and b, then 6/6’ represents exactly a subgroup of J/S, i.e., (b:b’) is finite 
and b has the same genus in S and in J. 

If, conversely, (a’) and (a’’) are satisfied, J is the direct sum of the rational 
groups J;, J; the intersection of S and J;, S’ the direct sum of the groups J; , 
then S’ < S, (J;:J;) finite, i.e., J/S’ is finite and therefore J/S = (J/S’), 
(S/S’) is finite. 

Suppose now that J is a direct sum of a finite number of isomorphic rational 
groups of genus s and that J/S = F is finite. There exists a greatest closed 
subgroup S’ of J such that S’ < S._ S’ is by Corollary 3.5 a direct summand of 
J, since r(J) is finite and S’ is closed. If J = S’ + J”, then S = S’ + 8”, 
where S” is the intersection of Sand J”. F = J/S and J”/S” are isomorphic 
finite groups and there exists therefore a basis B of J” mod 8S”. 

(7.1.1) Every basis B of J” mod S” is a greatest independent subset of J”. 
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For if >> eb = 0 with integer, relatively prime coefficients c , then ob = 0 
binB 
mod S’’, and therefore c, = 0 mod m(F), i.e., c = 0 and B is independent. 

In order to prove that B contains r(J’’) elements, denote by B the closed 
subgroup of J’’ generated by B. Since every class of J’’/S” contains elements 
of B, there exists corresponding to every element u # 0 of J” an element s(u) 
such that s(u) = 0 mod 8”, u = s(u) mod B. Since 0 is the only closed sub- 
group of J” contained in S”, there exists corresponding to every element s # 0 
of S” a positive integer w(s) such that w(s)”'s exists in J, but not in S”. 
There exists, therefore, corresponding to every element v of J” a chain s;, t; 
satisfying 


v=, 8; 


0, if t; = 0; tins = 0, if s; = 0, 
5, = s(t;), if t; = 0; tisa = w(s;) 's;, if 8; # 0. 


k-1 
Then v = [] w(s,) s, mod B, if s; = 0 for i < k, and therefore v = 0 mod B, 
i=0 


if at least one s, = 0. 

If none of the s; = 0, none of the numbers w(s;) is relatively prime to the 
finite order f of the finite group F and there exists therefore a prime number 
p|/f (p therefore relatively prime to the infinite parts of the multiplicities of 
elements in J’’) such that 


v = p'z;mod B 


has for every i a solution z; in J’. Since B is by Corollary 3.5 a direct sum- 
mand of J’, this implies that v = 0 mod B, ie., J” = B and this completes 
the proof of (7.1.1). 

Let now bi, --- , bm form a basis of J” mod 8S” such that m = m(F) and 
m,(F) is the order of b; mod S’’.. Denote by B; the closed subgroup of J gener- 
ated by bk, ---,b;. Then B, is a rational group, B, = J” by (7.1.1), and 
Bin = B; + Rin by Corollary 3.5. Then bi = vier + visa With uiss in B;, 
vis. in R;,, and the elements v; = b; , v2, --- , Um form a basis of J” which is also 
a basis of J’ mod S”. This shows that the groups R; together with S’ define 
a decomposition of J which meets the requirements of (b+). 

Remark. The following example shows that it is impossible to omit in Lemma 
7.1b the assumption that all the elements # 0 of J have the same genus in J. 

A basis of the group J is formed by the elements b’, b’”’ with m(b’ < J) = 1, 
m(b’’ < J) = 2” and the subgroup S is generated by the elements 3b’ + b”, 
2°'3b”. Then J/S is a cyclic group of order 3, but there does not exist a de- 
composition (b+) of J, since a multiple of b’” is contained in every basis of J. 

DEFINITION 7.2. The direct decomposition 


Jaz Jo+T' 


of the group J is a complete reduction of the subgroup S in J, if the groups J, are 
rational and S = >> S, for S, = intersection of S and Jy . 
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If there exists a complete reduction of the subgroup S in J, then S is com- 
pletely reducible in J. 

If the subgroup S of J is completely reducible in J, then S is completely 
reducible. The converse is not true, since e.g., rational subgroups S of separable 
groups J are completely reducible in them if, and only if, all the elements + 0 in 
S are primitive in J. It is a consequence of Lemma 7.1b that S is completely 
reducible in J, if J/S is finite, r(J) = r(S), J is completely reducible and all 
the elements ~ 0 in J have the same genus in J. 

Coro.iary 7.3. Suppose that J and its subgroup S have the same finite rank. 

(a) S ts completely reducible in J, all the elements # 0 of J have the same genus 
g in J and all the elements ¥ 0 of S have the same genus s (S g) in S if, and only 
if, there exists a subgroup T of J such that 

(1) SST sJ, 

(2) all the elements t 0 of T have the same defect d = d(it < T < J)inJ 
(and therefore J = dT), 

(3) T is a direct sum of isomorphic rational groups, 

(4) all the elements ¥ 0 of S have the same genus in S and in T. 

(b) If the subgroup S of J satisfies the conditions of (a), then there exist de- 
compositions into rational groups: 


r(J) 


r(J) 
(b+) J=aD Ji, S=) 5S, 
t=1 fe | 


where S; = intersection of Sand J;,\ Ji | = g, | Si| = s, (Je: Si) = m(J/S) 
i-th invariant of the group J/S without elements of infinite order for 1 S i 
m(J/S). 

Note that either all the numbers (J;:S,;) are ordinary integers—the case dealt 
with in Lemma 7.1—or all the (J;:S;) have the same genus and m(J/S) = 
r(J) = r(S). 

Proof. Suppose first that S is completely reducible in J, that all the elements 
~ 0 of J have the same genus g, and that all the elements # 0 of S have the 
same genus sin S. Then there exist decompositions into rational groups 


IA I 


r(J) r(J) 
J=DR, S=DR, |Rl=g, (|Ril=s, 
t=} =i 


where R’, is the intersection of S and R; . 
Denote by d the g.c.d. of all the djs < S < J) with s ¥ 0 in S and by d* 


r(J) 
the g.c.d. of the numbers (R;:R:). If s ¥ 0 is an element of S, then s = > s 
i=1 


with s;in Rj. Then m(s < S) is the g.c.d. of the numbers m(s; < R;) and m(s < 
J) is the g.c.d. of the numbers m(s; < R,). If s; ¥ 0, there exists a uniquely 
determined ordinary positive integer h; which is relatively prime to the infinite 
part of the numbers of genus g such that 


m(s;i < J) = him(s < J) 
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and a uniquely determined ordinary positive integer k; which is relatively prime 
to the infinite part of the numbers of genus s such that 

m(s; < S) = kym(s < 8S). 
Then hid(s < S < J) = k(R;:R}), and since the numbers h; with s; + 0 are 
relatively prime, and also the numbers k; with s; # 0 are relatively prime, it 
follows that d(s < S < J) is the g.c.d. of the numbers (R;:R;) with s; ¥ 0, 
i.e., that 


d = d*. 


Since r(./) is finite and all the numbers (R;:R;) have the same genus, it fol- 
lows that every n; = (R;:R{)id is an ordinary integer. Denote now by RY’ 
the subgroup of R; which satisfies n:R = R;. Then (RiRY) = d, (RY:R)) = 
n; and the direct sum 


satisfies (1)—(4). 
Suppose secondly that there exists a subgroup 7’ of J which satisfies (1)-(4). 
Then there exists by Lemma 7.1 a decomposition 


r(J) r(J) 
T= 7;, S=DS, 
$=} i=l 
such that S; is the intersection of S and T;, | S;| = | 7;| = s and the numbers 
(7;:S;) with 1 S 7 S m(T/S) are the invariants of the finite group 77/8. 


r(J) 


If J; = d'T; is the closed subgroup of J generated by T;, then J = >> Ji, 
63, 


| Ji | = t and (J;:S;) = d(T;:S)), i.e., these decompositions of J and S are 
decompositions (b+). 

If finally there exist decompositions (b+) of J and S, then S is completely 
reducible in J, all the elements ¥ 0 of J have the same genus in J and all the 
elements * 0 of S have the same genus in S. 

If J and its subgroup S have the same finite rank; if J is completely reducible; 
if all the elements # 0 of J have the same genus in J; and if all the elements 
~ 0 of S have the same genus in S, then it can happen that S is not completely 
reducible in J. This is shown by 

Example 7.4. Let J be a group with a basis b’, b” such that m(b’ < J) = 
m(b’’ < J) = p” for a given prime number p. Choose the ordinary integers ¢; 


in such a way that ¢ = +» c;p' is an irrational p-adic number and 0 S ¢; < p. 
i=0 
Denote by S the subgroup of J, generated by the elements b’, b”, p ‘(b’ + 
¢--3 
os c;p’b”) fori = 1, 2,---. Then all the multiplicities m(s < S) of elements 


i=0 
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s ~ Oin S are ordinary integers, S/pS is a cyclic group of order p and S is 
therefore not even completely reducible (in S)."* 

Coro.uary 7.5. Suppose that the group J and its subgroup S satisfy: 

(1) S and J have the same finite rank; 

(2) J is a direct sum of isomorphic rational groups (of genus g); 

(3) The g.c.d. d(S < J) of the defects d(s < S < J) of the elements s # 0 of 
S in J has the same genus as these defects; 

(4) dS < J), = 1. 

Then S is completely reducible in J and all the elements # 0 of S have the same 
genus in S. 

Proof. There exists by (3) to every element s + 0 of S an ordinary integer 
h(s) such that d(s < S < J) = h(s)d(S < J). Therefore m(s < J)jd(S < J) = 
h(s)m(s < S) by (4) and all the elements # 0 of S have consequently by (2) 
the same genus s in S. 

If U is the subgroup of J which satisfies U = d(S < J)"S and S = d(S < 
J)U by (4), then all the elements # 0 of U have the same genus in U and in J. 
There exists therefore by Lemma 7.1 decompositions 


r\J) r(J) 
T= Ji, U= DU, 
i= i=l 
r(J) 
U; = intersection of U and J;. Thus S = d(S < J)U = }0 d(S < J)U; and 
o=4 
d(S < J)U; = intersection of S and J;, i.e., S is completely reducible in J. 
Noration. If S is a subset of the group J, then S is the closed subgroup of J, 
generated by S. 
If s S tare genera and S a subgroup of the group J, then 
(S < J,t,s) = intersection of (J,t |x < J|) and (S,s <= |x< S8)), 
(S < J,t, s+) = intersection of (J,t S |x < J|) and (S,s <|x2< 8)), 
(S < J,t,s)* join of (S < J, t, s) and (S < J, t, s+), 
n(S < J, t, s) r((S < J, t, s)/(S < J, t,s+)), 
N(S < J,t,s) = (S < J,t,s)/(S < J, t, s)*. 
These notations and Corollary 7.3 imply the following statement. 
(7.6) If the formulas 


Ja) B+J' S => &, 











where S, is both a rational group and the intersection of S and 8, , give a complete 
reduction of Sin J, and if fors St 


144 The group S belongs to a class of groups which has been discussed recently by A. 
Kurosch, Ann. of Math., vol. 38 (1937), pp. 175-204. 

18 Note that the group J and its subgroup S which have been considered as Example 7.4 
satisfy (1)-(3). 
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S(t, s) is the direct sum of those groups S, which satisfy s = | S,| andt = | 8, |, 
then 


(S<J,t,s)= >> Dd Swv,n), (S < J,t,s+) = © Dd S(v, u), 


tsv ssu tsv s<u 


n(S < J,t,s) = r(S(t, s)) = r(S(t,s)), N(S < J,t,s) = S(t, s)/S(t, s). 





If the ranks n(S < J, t, s) are finite, there exists a decomposition 
Jud B+ J’, S = 2 R, 


such that the rational group R, is the intersection of S and R, , S(t, s) is the direct 
sum of the groups R, with s = | R,|, t = | R. |, and the numbers (R,:R,) ¥ 1 
with s = | R,|,t = | R, | are exactly the invariants of N(S < J, t, s). 
Coro.uary 7.7. Suppose that S is.a subgroup of finite rank of the separable 
group J. Then S is completely reducible in J if, and only if, there exists a direct 


decomposition S = : Sti such that 
t i 


(a) S,, is completely reducible in Sx: ; 
(b) every element ¥ 0 of St = > Sti is a primitive element of genus t in J; 


©) & = + Bu. 


Proof. If S is completely reducible in J, then S is a completely reducible 
direct summand of J and the groups S(t, s) of (7.6) show the necessity of the 
condition. If conversely the condition is satisfied, then every S, is by (b) and 
Theorem 4.2 a direct summand of J. S is therefore the direct sum of the groups 
S, and by Theorem 4.2 a direct summand of J. S is completely reducible in J 
as a consequence of this fact and of (a). and (c). 

THEOREM 7.8. Suppose that the subgroups S and T are both completely re- 
ducible in the group J, and that all the ranks n(S < J, t, s) are finite. Then S 
and T are isotype in J if, and only if, 

(a) n(S < J,t,s) = n(T < J, t, s) for any two generas & t; 

(b) N(S < J, t, s) and N(T < J, t, s) are isomorphic for any two genera 
s st; 

(c) J/S and J/T are isomorphic. 

Proof. The necessity of the conditions is obvious. If the conditions (a) to (¢) 
are satisfied, there exist by (7.6) direct decompositions 

Jad b+ = 3 F, 4+ 2", S= > 8., T=?) T, 
such that the rational groups S, are the intersections of S and 8S, , the rational 
groups 7, the intersections of T and T,, the numbers (S,:S,) # 1 are the 
invariants of N(S < J, | 8, |, | S, |), the numbers (7,:7,) # 1 are the invari- 
ants of N(T < J,| T.|,| T» |). (Note that by (a) also the ranks n(T’ < J, t, s) 
are finite.) 

By (a), (b) and (7.6) it is possible to denote the groups S, in such a way that 


13] =| 7.1, [S| =| Tol, (B.:8,) = (Te: Ts) 
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for every v. Since J’ and (J/S)/F(J/S), J” and (J/T)/F(J/T) are isomorphic, 
(c) implies that J’ and J” are isomorphic. Therefore there exists a proper auto- 
morphism of J which maps 8, upon 7, , J’ upon J”, and since this automorphism 
also maps S, upon 7’, , i.e., S upon 7, S and T are isotype in J. 

Coro.Liary 7.9. Suppose that S and T are subgroups of finite rank of the 
separable group J, and that S and T are both completely reducible in J. Then S 
and T are isotype in J if, and only if, for any two genera s S t 

(a) n(S < J, t,s) = n(T < J,t,s); 

(b) N(S < J, t,s) and N(T < J, t, s) are isomorphic. 

Proof. Suppose that the conditions (a) and (b) are satisfied. By Corollary 
4.3 J has a completely reducible direct summand D of finite rank which con- 
tains both S and 7. Then it follows from (a), (b) and the finiteness of the 
ranks n that D/S and D/T are isomorphic, and S and T are therefore isotype in 
J by Theorem 7.8. 

If the ranks n(S < J, t, s) are finite, the ranks n and the groups N satisfy 

(i) m(N(S < J, t, s)) S n(S < J,t, s), and the equality holds, ifs < t; 

(ii) D> n(S < J, t,s) < r(J(t)*); 

t 


(ili) to every invariant m;(N(S < J, t, s)) there exists a number ¢ of genus t 
and a number s of genus s such that t;s = m,(N(S < J, t,s)). 
If >> n(S < J, t, s) is finite and J is separable, the above conditions are also 
ts 


sufficient for the existence of a subgroup S of J with these given ranks n and 
groups N. 

The following particular cases of the Corollary 7.9 may be mentioned. Sup- 
pose that J is a separable group, that S and T are subgroups of finite rank, 
that all the elements + 0 of S and of T are primitive elements of the given 
genus g in J, that either all the elements + 0 in S and in T have in S and T 
respectively the genus g; or all the elements # 0 in S and in T have in S and T 
the genus s(< g), and that S and T are completely reducible in J. Then S 
and 7 are isotype in J if, and only if, 

(a) r(S) = r(T), 

(b) F(J/S) and F(J/T) are isomorphic. 

Since r(J) = r(S) + r((J/S)/F(J/S)) = r(S) + r(J/8), we have also the 
following result. If the subgroups S and T of the separable group J satisfy the 
above assumptions, and if the rank of J is finite (and J therefore completely 
reducible), then S and T are isotype in J if, and only if, J/S and J/T are 
isomorphic. 

If finally J is a direct sum of a finite number of infinite cyclic groups, and S is 
any subgroup of J, then J/S is finite and the type of S in J is completely deter- 
mined by the structure of J/S. 

The following remark may give an idea of what happens if subgroups are 
considered which are not completely reducible in the whole group. Let J be a 
complete group. Then the type of the subgroup S of J in J is completely 
determined by the structure of S and the structure of J/S. 
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Chapter IV. Reducibility and separability 


8. Decomposition into isomorphic rational groups. 

Lemma 8.1. Suppose that S is a subgroup of the group J and that all the elements 
of the class b* ¥ 0 of J/S have the same genus sin J. Then 

(a) Ss (J,s S |2)). 

(b) s is the genus of b* in J/S if, and only if, there exists in b* an element b 
such that m(b < J) = m(b < J/S). 

Proof. If s # 0 is an element of S, b an element of b*, then b and b + s 
have both genus s and s is therefore a divisor of the genus of s. 

If there exists in b* an element b such that m(b < J) = m(b < J/S), then 
b and b* have the same multiplicity and therefore the same genus. 

Suppose now that s is the genus of b* and b’ an element of J in b*. Then 
m(b’ < J)/ m(b! < J/S) = m(b* < J/S) and there exists an ordinary positive 
integer h which is relatively prime to m(b’),, such that hm(b’ < J) = m(b! < 
J/S). Denote by h’ the greatest divisor of m(b’ < J/S) such that the same 
prime numbers divide h and h’. Then h’ is an ordinary positive integer and 
there exists an element z in J such that h’z = b’ mod S._ There exists, as above, 
an ordinary positive integer k which is relatively prime to m(h’z),, such that 
km(h'z < J) = m(h’z < J/S) = m(b’ < J/S) = m. Since m(h'z) = h'm(z), 
moh’ exists and is relatively prime to h’ (by its definition) and k and h are 
therefore relatively prime. There exist ordinary integers k’, h’”’ such that 
kk” + hh” = 1 and 


b = h”/'hb’ + k’kh’'z = 6’ mod S 


satisfies m(b < J) / m/ g.c.d. h”hm(b! < J), k"kmth’z < J)/m(b < J), ie., 
m(b < J) = m(b < J/S). 

Coro.uary 8.2. Suppose that S < J and that all the elements of J which are 
not contained in S have the same genus sin J. Then 

(a) (J,s<|r|) Ss S<J=(J,8s € |z)). 

(b) All the elements ¥ 0 of J/S have the genus s if, and only if, there exists 
corresponding to every element b of J an element b’ of J such that 


b = b’ mod S, m(b’ < J) = mb! < J/S)(= mb < J/S)). 


This follows from Lemma 8.1 and the fact that every element of (J, s < | x |) 
is the sum of elements of S. 

DeFIniTION 8.3. The class 1; consists of all countable groups. If v is a finite 
or infinite ordinal greater than 1, then T, consists of all groups J satisfying 

(a) J does not belong to a class T, with uw < v; 

(b) There exists a closed subgroup S of finite rank of J such that J/S is a direct 
sum of groups which belong to the join TY” of the classes T, with w < v. 

The additive group P of the integer p-adic numbers belongs to the second 
class and the direct sum of an infinity of such groups P belongs to the third class. 
The additive group of all the sequences of integers does not belong to any of 
these classes, as will be proved in the section on vector-groups. 

THEOREM 8.4. Assume that all the elements of the group J which are not con- 
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tained in the subgroup S of J have the same genus's. Then J is the direct sum of S 
and rational groups (of genus s) if, and only if, 

(a) S is a closed subgroup of J; 

(b) the elements ¥ 0 of J/T have the genus s in J/T for every closed subgroup 
T of J such that S = T and r(T/S) is finite; 

(c) J/S belongs to a class 1, . 

Proof. A. Suppose that J = S + R, where R is completely reducible. 
Then S is a closed subgroup of J. If S < T < J, then T = S + T} where 
T’ is the intersection of T and R. If T is a closed subgroup of J and r(7’/S) 
is finite, then 7” is a closed subgroup of R and r(7”) is finite. 7” is by Corollary 
4.4 a direct summand of R, i.e., R = T’ + R’ and consequently all the elements 
~ 0 of J/T are of genus s in J/T, since J/T and R’ are isomorphic. Since 
finally R and J/S are isomorphic, J/S belongs either to T, or to T: . 

B. Suppose now that the conditions (a) to (c) are satisfied by the subgroup 
S of J. 

1. r(J/S) = 1. 

Then there exists by Corollary 8.2 an element b in J such that b # 0 mod S, 
m(b < J) = m(b < J/S). The elements of the closed subgroup 6 of J, gen- 
erated by b, represent therefore exactly the classes of J/S and consequently 
J=S+6. 

2. J/S is countable. 

Then there exist closed subgroups 7; of J for —1 S i < r(J/S) such that 
S = T.1, Tian < T;, r(Ti/Tin) = 1, and such that J is the join of the groups 
T; Gf r(J/S) = m + 1 is finite, then J = T,,). Since r(7;/S) = i + 1, it 
follows from 1 that 7; = T;. + R;for —1 <i < r(J/S), where R; is a rational 
group of genus s._ Since ./ is the join of the groups 77; , it follows that 

J=S+ DR. 
0<i<r(s/s) 

3. J/S belongs to the class I, . 

Since for y = 1 the theorem has been proved in 2, it can be assumed that the 
theorem holds true for subgroups S’ of groups J’ which satisfy (a), (b) and such 
that J’/S’ belongs to I”. 

Since J/S belongs to I, , there exists a closed subgroup W of J such that 
S < W,7r(W/S) is finite and J* = J/W is the direct sum of groups J, belong- 
ing tol’. Let J, be the subgroup of J which contains W and satisfies J,/W = 
Jz. Then the subgroup W of J, satisfies the conditions (a), (b), and since J? 
belongs to I’, J, = W + W,, where W, is completely reducible. Suppose now 
that b is an element of W, that the elements b; belong to different groups W, 
and that b + >> b; = 0. Then every b; belongs to W, since J* is the direct 


sum of the groups Fl i.e., b; = 0, since 0 is the intersection of W and W,, i.e., 
b = 0. Thus the groups W, W, are independent and, since every class of J/W 
contains elements of the sum of the groups W, , it follows that 


J=W+>W. 
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Since r(W/S) is finite, W/S is countable and it follows from the case, treated 
in 2, that W = S + V, with V completely reducible, and consequently J = 
S+R,withhR =V+ DW, completely reducible. 


Corouiary 8.5. The group J is a direct sum of isomorphic rational groups 
(of genus s), if, and only if, 

(a) the elements # 0 of J/T all have the same genus s for every closed subgroup 
T of finite rank of J; 

(b) J belongs toa class 1, . 

This follows from Theorem 8.4 by choosing S = 0. 

Remarks. 1. If J is complete, then J is completely reducible and in this 
case the conditions (a) and (b) are not needed. 

2. If J is a closed subgroup of the group P of all the integer p-adic numbers, 
then J/pJ is a group of order p. Since all the elements # 0 in S have the same 
genus, J is directly irreducible, and satisfies condition (b) and a weaker form of 
condition (a). This shows that condition (a) cannot be omitted, since there 
exist closed subgroups J of every finite rank in P. 

3. The additive group of all the sequences of integers furnishes an example 
of a group, satisfying condition (a), but not condition (b), as will be proved in 
the section on vector-groups. 

The following criterion provides a handy method for constructing a basis in 
some simple cases. 

Coro.uary 8.6. Suppose that n + 1 is the product of a finite number of differ- 
ent prime numbers, that n is the genus of those numbers whose infinite part is 
divisible by all prime numbers which are relatively prime to n and that J is a direct 
sum of a finite number of rational groups of genusn. Then the subset B of J isa 
basis of J if, and only if, B consists of multiples of the elements of a basis of J 
mod nJ. 

Proof. If first B is a basis of J, the elements m(b < J);'b with b in B form 
a basis of J mod nJ. Assume conversely that the elements in B are multiples 
of the elements of the basis B’ of J mod nJ. B’ is an independent subset of J, 
since every relation between elements of B’ implies a relation mod nJ. Denote 
now by B the closed subgroup of J, generated by B (or by B’). If w is any 
element in J, there exists an element w’ in B such that w = w’ mod nJ. The 
congruence nz = w mod B has therefore a solution z in J for every element w, 
ie., J/B = n(J/B). Corollary 8.5 implies therefore that J = B, ie., that 
B’ and B are greatest independent subsets of J. Consequently there exist 
corresponding to every element w # 0 in J relatively prime integers k ¥ 0, 
k(b) such that 

kw = DO k(byb. 


bin B’ 


k and n are relatively prime, since B’ is a basis of J mod nJ and therefore a 

oe ‘oe = 
common divisor of k and n divides every k(b). Hence every element k k(b)b 
exists in J, i.e., B’ and therefore B is a basis of J. 
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Note that the finiteness of r(J) is a consequence of the validity of the above 
criterion and of the other assumptions of Corollary 8.6. 

Coro.uary 8.7. Suppose that all the elements of the group J which are not 
contained in the subgroup S of J have the same genus sin J. Then the following 
three statements are equivalent. 

(a) J is a direct sum of S and a completely reducible group. 

(b) J/S is a direct sum of rational groups of genus s. 

(ec) J/S ts separable, belongs to a class T, and all the elements ¥ 0 in J/S have 
the genus s. 

Proof. If J is the direct sum of S and the completely reducible group R, 
then # is a direct sum of groups of genus s and R and J/S are isomorphic, i.e., 
J/S is a direct sum of rational groups of genus s. If J/S is a direct sum of 
rational groups of genus s, then J/S is separable, belongs to T, or to T, and all 
its elements ~ 0 have the genus s. If finally J/S is separable, belongs to a 
class I’, and all the elements ~ 0 of J/S have the genus s, it follows from Corol- 
lary 4.4 that the conditions (a) to (c) of Theorem 8.4 are satisfied by J and its 
subgroup S, and J is therefore the direct sum of S and a completely reducible 
group. 

Remark. Since there exist direct irreducible groups of rank 2 whose elements 
~ 0 all have the same genus, the words “of genus s” cannot be omitted in (b). 

THEOREM 8.8. If the subgroup S of the group J and the genus s satisfy: 

(a) (S,s < | x]) is the intersection of Sand (J,s < |x}); 

(b) J(s)* = (WJ, s S |x|)/(J, 8 < |x|) ts a direct sum of rational groups of 
genus Ss; 
then (S, s S |x|) is the direct sum of (S, s < |x|) and a completely reducible 
group S(s) and r(S(s)) S r(J(s)*), i.e., S(s) is isomorphic with a direct summand 
of J(s)*. 

Proof.’ By condition (b) and Corollary 8.7a, b there exist rational groups 
R, of genus s such that 


(VJ,s< |r|) =(W,s<{a2]) + dX R,. 


It can be assumed that the indices v are the ordinal numbers, satisfying 
0O<v<o. Put J, = (J,s < |x|) + > R, for 0 <v S cand S, = inter- 


p<v 
section of (S,s < |x|) andJ,. Then J; = (J,s < |x|), Si = (S,s < | x}) 
by (a), Jou = Jp + Ry, Je = join of the groups J, with p < 2, if v is a limit 
ordinal, J, = (J,s S |x|), S, = (S,s S| 2)). 

Since S,4:/S8, is isomorphic with a subgroup of R,, it follows therefore by 
Corollary 8.2 that either S, = S,,, or that S,,,/S, is a rational group of genus s. 
Thus by Theorem 8.4 S,.; = S, + T,, where either T, = 0 or a rational group 
of genus s. 


16 A similar proof for the complete reducibility of the subgroups of direct sums of in- 
finite cyclic groups is due to L. Zippin. 
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The subgroup S’ of (S, s S | x |) which is generated by (S, s < | x |) and the 
groups 7’, is the direct sum of these groups. S, < 8S’. It can therefore be 
assumed that S, < S’ for every p < v. If v = v’ + 1 is not a limit number, 
then S, = S, + T, Ss 8S’. If v is a limit ordinal, then S, is the join of the 


groups S, with p < v and consequently S, < S’. Hence S, < S’, i.e., 


(S,s<j|r|)=(S,s<jr))+ D7. 


This completes the proof of the theorem. 

Corouuary 8.9. Let J be a direct sum of rational groups of genus s. Then 
the following three statements are equivalent. 

(a) The subgroup S of J is a direct sum of rational groups of genus s. 

(b) All the elements # 0 of the subgroup S of J have in S the genus s. 

(c) The subgroup S of J is isomorphic with a direct summand of J. 

Proof. (a) implies (b). If (b) is satisfied, the conditions of Theorem 8.8 
are satisfied, i.e., (b) implies (c). (¢) implies (a), since every direct summand 
of J satisfies the conditions of Theorem 8.8. 

Remark. This corollary implies in particular that every subgroup of a direct 
sum of infinite cyclic groups is a direct sum of infinite cyclic groups. 

Coro.iary 8.10. If the subgroup S of the group J and the genus s satisfy 

(a) J(s)* = (J, s S |2})/(J, 8s < |x|) ts a direct sum of rational groups of 
genus Ss; 

(b) (S,s < | x |) ts the intersection of S and (J,s < |x|); 

(c) the subgroup S(s)* of all those classes of J(s)* which contain elements of 
(S,s S |x|) ts a direct summand of J(s)*; 
there exist subgroups UL’, V such that 


VJ,s<i|r)=U+V4H,s<j|2r)), (Sss |r) =U+(S,8 <2) 


and the elements of U represent exactly the classes of S(s)*. 
Proof. Since conditions (a), (b) and (c) imply the conditions of Theorem 
8.8, it follows that 


(S,s<|r|/)=(S,s<jr|)+U 
where U is a direct sum of rational groups of genus s. Furthermore, U repre- 
sents exactly the classes of S(s)*. By condition (c), J(s)* = T* + S(s)*. By 
condition (a) and Corollary 8.7a, b it follows that 
(J,s = |r|) = V,s <|x|) + J(s) 


where J(s) is a direct sum of rational groups of genus s._ If V is the subgroup 
of J(s) which represents the classes of 7*, then 


(VJ,ss|[rl)=(VU,s< |r|) + U4 SV, 


and this completes the proof. 
Note that the above groups U’ and V are direct sums of rational groups of 
genus s. Furthermore, it is a consequence of Corollary 4.4 that condition (c) 
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is satisfied, if S(s)* is a closed subgroup of finite rank of J(s)* or if J(s)* is 
complete and S(s)* a closed subgroup. 


9. Partial reducibility. 
Notation. If J is any group, then 
A(./) is the set of all the genera s such that 


J(s)** = (J, |x| €s)/(WJ, |x| £8) ¥ 0; 
B(./) is the set of all the genera s such that 
J(s)* = (J,s = |2|)/(J,s < |x|) 4 0; 


and C(.J) ts the set of all the genera s such that elements of genus s exist in J. 

A(J) < B(J), since there exists by (2.4b) a homomorphism of J(s)* upon 
the whole group J(s)**, and it follows from (2.4b) that A(/J) = B(J), if (J, s < 
| x |) is always the intersection of (J, s S |x|) and (J, |x| £ s). 

B(J) < C(J), since elements of (J, s S |x|) which are not contained in 
(J, s < |2|) have in J the genus s. 

Cuatn ConpiTion. Jf s; S & S --- £8; S --- ts an ascending chain of 
genera S; in the set G of genera, almost all the s; are equal. 

If the set G of genera satisfies the chain condition, every subset of G satisfies 
the chain condition and every subset of G contains a “greatest’’ element g such 
that g <« s for every genus s in G and the chain condition is conversely a conse- 
quence of this condition. 

(9.1) If r(J) is finite, the chain condition is satisfied in C(.J) (and therefore in 
A(J) and in B(J)). 

For if |b| = s <t, then (J,t S |x|) < (J,s S | x|) and since (J, g < | x |) 
is a closed subgroup of J and r(J/) is finite, this implies r((J, t S | x |)) < r((J, 
s < |x |)), and thus the chain condition holds in C(,/), if r(J/) is finite.” 

It may happen that r(J) is finite and B(./) is infinite. This is shown by the 
following 

Example 9.2. Let pi, po, --- be an enumeration of the prime numbers and 
J a group of rank 2 which contains two independent elements b’ and b’’ such 
that the p-value of m(b’ + pib”) is 0, if p ¥ pi, and infinite, if p = p;. If 
pi is the genus of p;, then 0 = (J, pi < | x]), and (J, pi S | x]) consists of all 
the rational multiples of b’ + p,b’, i.e., B(J) contains every p; and is therefore 
infinite. 

THEOREM 9.3. If the chain condition is satisfied in B(J), then the following 
three propositions are equivalent. 

(a) J ts partially reducible. 

(b) J has the following properties: 





17 Another consequence of this argument is that also the ‘‘descending chain condition”’ 


holds in C(J). 
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x |); 


(b1) for every genus s, (J, 8 < |x|) is a direct summand of (J, s S | 
|x|) and 


(b2) for every genus s, (J, s < |x|) is the intersection of (J,s & | 
(J,|z| £8); 
(b3) corresponding to every element b ~ 0 in J there exists at least one genus 
s such that b = 0 mod (J, |x| ¢ s),b 4 Omod (J, |x| £s); 
(b4) to every element b of J there exists at most a finite number of genera s 
such that b = 0 mod (J, | «| < s),b 4 Omod (J, |x| £8). 
(c) If for every genus s the subgroup J(s) of J satisfies 


(es) (J,sS{x!l)=(J,s < |x|) + J(s), 


then J is the direct sum of these groups J(s).”" 

Proof. 1. If J is partially reducible, it follows from (2.6) and (2.7) that J 
fulfills the conditions (bl) to (b4). 

2. If J satisfies (b2) and J(s) is a solution of (es), then J(s) is by (2.4) also 
a solution of 


(és) (J,|x| £s) = J(s) + WV, |x| € 8). 


3. Suppose that J satisfies (b2) and that, for every genus s, J(s) is a solution 
of (es). If b) , --- , bd, are a finite number of elements # 0 which belong to 
different groups J(s), there exists one among them, say b; , such that |b; | <« 
|b, |. Then 


k 
b= >» b; = 0 mod (J, |x| £ |b |), b = b; 4 0 mod (J, |x| € |b: }), 


since, as proved in 2, J(| b; |) is a solution of (@| bi; |). Hence b ¥ 0 and this 
implies 

(9.3.3) If J satisfies (b2) and if, for every genus s, J(s) is a solution of (es), 
then the subgroup of J, generated by the groups J(s), is their direct sum. 

4. Suppose that J satisfies the conditions (b1) to (b4) and that, for every 
genus Ss, J(s) is a solution of (es). 

If b ¥ Ois an element of J, denote by A(b) the set of all the genera s such that 
b = 0 mod (J, |x| £ s),b A O mod (J, |x| £s). Denote by J’ the direct 
sum of the groups J(s) (which exists by (9.3.3) in J). 

Since every J(s) is also a solution of (Gs), there exists to every element b + 0 
and to every genus s in A(b) a uniquely determined element b(s) in J(s) such 
that b = b(s) mod (J,| x2 | €s). Since by (b4) the set A(6) is finite, it is possible 
to form the sum b’ of all the elements b(s) with s in A(b). 

Put b’” = b — b’ and let w be a genus such that s < w for every s in A(b). 
Since every b(s) # 0 and therefore every b(s) is an element of genus s, this 
implies that b = b” mod (J, |x| « w). If w # s for every s in A(b), then 
b(s) = 0 mod (J, |x| £ w), ie., b&” = b = 0 mod (J, |x| £ w) and w does 
not belong toA(b’”’). If wis an element of A(b), then b = b(w) mod (J, | «| € w) 


'8 And there exist solutions of the equations (es). 
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and therefore b’” = 0 mod (J, | z| £ w), i.e., w does not belong toA(b”). “Thus 
it has been proved that b = b” mod J’, and every element of A(b’’) is a true mul- 
tiple of at least one genus in A(b). 

5. Suppose finally that the chain condition is satisfied by B(/) and that J 
satisfies (bl) to (b4). There exist solutions /(s) of (es), their direct sum J’ 
is a subgroup of J and every element b of J is congruent mod J’ to an element 
b’ of J such that every genus in A(b’) is a true multiple of at least one genus in 
A(b). Since A(u) is vacuous if, and only if, u = 0, and since every A(u) is finite 
and a subset of B(J) = A(./), it follows from the chain condition that J = J’, 
i.e., (b) implies (c). 

6. Since the decomposition of J into the solutions J(s) of (es) is a partial 
reduction of J, (c) implies (a). 

That (a) is not a consequence of (b), if the chain condition in B(/) is not 
satisfied, is shown by the following 

Example 9.4. Let pi, po, --- be an enumeration of the prime numbers, 
R; a rational group of genus r; = | [] p; 

}i=1 
whose i-th coérdinate is an element of R;. 

The subgroup of all the vectors of J whose coérdinates except the i-th are 0 
is isomorphic with R; and may be identified with R;. 

Since ri, < r;, a vector v of J whose i-th codrdinate is the first coérdinate 
~ Ohas the genusr;in J. ThusC(/) consists exactly of the genera r; , 


, J the additive group of all the vectors 





VU,rsi|r)=V,|2| tr) =Ri+ Jr <|2|) = VU,ta <|2)), 


and therefore A(J) = B(J) = C(J). 

J satisfies all the conditions (b1) to (b4). But since J contains a continuum 
of elements and the direct sum of the groups R; is countable, J is neither com- 
pletely nor partially reducible.” 

That (c) is not a consequence of (a)—and even not of complete reducibility— 
if the chain condition in B(J) does not hold is shown by 

Example 9.5. Denote by s: < Ss: < --- < s; < --- some ascending chain 
of genera, by J; a rational group of genus s; and by J the direct sum of the 
groups J;. If b; + 0 is an element of J; , these elements b; form a basis of J. 
Put w; = b; + bis: and let @; be the closed subgroup of J, generated by wu; . 
The groups @; form a complete set of solutions + 0 of the equations (cs), if 
m(b;) | m(bi41), but their direct sum W does not contain b; and is < J. 

That (b3) is not a consequence of the chain condition in B(./) and the condi- 
tions (b1), (b2), (b4), is shown by 

Example 9.6. Let pi, pe,--- be an enumeration of the prime numbers, 
J’ the group generated by the elements (i , --- , %) fork = 1,2, ---,7; = 0,1, 
satisfying the relations (i, ---,%) = (,---,%,0) + (4,---,%, 1) for 


1 The separability of this group J will be proved in section 11. 
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0<*%*. J is the smallest group between J’ and some complete group such that 
i = wery 4 < bk 
the p-value of m((i, --- ,%) <J)is satiate dniaaiteadaie 
0, if p = poj.i, for some 7 S k. 
Every element (i, --- , 7%) is therefore in J the sum of two elements whose 
genera are true multiples of |(i,---, ¢)|. Consequently (J, s < |a]|) = 
(J,s S |x|) and (J, |a2| €s) = (J, |2| < 8) for every genus s, i.e., A(J) 
and B(./) are both vacuous. Thus J is a countable group, the chain condition 
satisfied in A(J) and in B(J/), conditions (b1), (b2) and (b4) are satisfied in J. 
But (b3) is not satisfied. 
Corouuary 9.7. If r(J) is finite, the following propositions are equivalent. 
(a) J is partially reducible. 
(b) For every genus s, (J, 8 < | x |) is a direct summand of (J, s S | x |) and 


r(J(s)*) = r(J(s)**/F(J(s)™*)). 


(c) For every genus s, (J,s < | x |) is a direct summand of (J,s S | x |) and is 
the intersection of (J,s < |x|) and (J, |x| <s). 
(d) If for every genus s the subgroup J(s) of J satisfies 


(ds) (J,s < |x|) = J(s) +-G,s < |2x)), 


then J is the direct sum of these groups J(s).” 

Proof. 1. If J is partially reducible, then J(s)* and J(s)** are by (2.6) 
isomorphic groups and thus (b) is a consequence of (a) and (2.6). 

2. Suppose that r(J) is finite and that J satisfies (b). There exists by (2.4) 
a homomorphism of J(s)* upon J(s)** and therefore there exists a homo- 
morphism a of J(s)* upon J(s)**/F(J(s)**). If W* is the subgroup of J(s)* 
which is mapped upon 0 by a, then J(s)*/W* and J(s)**/F(J(s)**) are iso- 
morphic. Since therefore W* is a closed subgroup of J (s)* and 


r(J(s)*) = r(W*) + r(J(s)**/F(J(s)**)) = r(W*) + r(J(s)*), 


and since all the ranks are finite, it follows that W* = 0. The homomorphism 
of J(s)* upon J(s)** is therefore an isomorphism and it follows from (2.4) that 
J satisfies (c). 

3. Suppose that r(J) is finite and that J satisfies (c). Then every subgroup 
(J, Ss S | x |) has finite rank and satisfies (c). 

If r(J) = 1, then (ec) and (d) are satisfied. It can therefore be assumed 
that the group J’ satisfies (d), if r(J’) < r(J/) and if J’ satisfies (ce). 

Suppose now that the subgroups J(s) of J are solutions of (ds) and that J’ 
is the subgroup of J generated by the subgroups J(s). Then it follows from 
(c) and (9.3.3) that J’ is the direct sum of the groups J(s). 

Let now b # 0 be an element of J and g its genusin J. Then (J,t S |x|) < 
J and therefore r((J, t S | 2|)) < r(J) for every genus g < t. Consequently 


2 And there exist solutions of the equations (ds), 
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every (J, t < |x|) with g < t is the direct sum of the groups J(s) witht S s. 
Since (J, g < | |) is the join of the groups (J, t S |x|) withg <t, (J, g < 
| z |) is the direct sum of the groups J(s) with g < s and (J, g S | x |) is there- 
fore the direct sum of the groups J(s) with g S s, i.e., b is contained in J’, i.e., 
J = J’ is the direct sum of the groups J(s). 
4. (d) implies (a), since the decomposition of J into the solutions of (ds) is a 
partial reduction of ./. 
Coro.uary 9.8. Suppose that B(J) satisfies 
(0) If sand t are two genera in B(J), then eithers <tors =tort<s. 
(I) If the chain condition is satisfied in B(.J), the propositions (a) to (c) of 
Theorem 9.3 are true in J if, and only if, 
(I’) for every genus s, (J, s < |x|) is a direct summand of (J, s S |x}), 
(I’’) elements b # 0 are not contained in (J, |b| < |x }). 
(II) If r(J) ts finite, the propositions (a) to (d) of Corollary 9.7 are true in J if, 
and only if, for every genus s, (J, 8 < |x|) isa direct summand of (J, s S| 2)}). 


Proof. If J is partially reducible and satisfies (0), b = >> b; is an element 
i=1 


~ 0 of J and the elements b; ¥ 0 belong to different components of a smallest 
partial reduction of J, then |b | < --- < |bk| and |b| = |b: |, ie., (I’) and 
(I’’) are satisfied by J. 

If conversely (0), (I’) and (I”) are satisfied in J, then A(J) = B(J) = C(J), 
(J,s = |z|) = WJ, |z| £8), J, s < |x|) = (J, |x| £ 8) and conditions 
(b1) to (b4) of Theorem 9.3 are therefore satisfied by J. 

If r(J) is finite and there exists for every genus s a solution J(s) of (ds) in 
Corollary 9.7, the subgroup J’ of J, generated by these groups J(s), is by (0) 
their direct sum. If b ¥ 0 is an element of genus g, then (J, g < |x|) is the 
join of the groups (J, t S |x|) withg <t. Thenr((J,t S |x|)) < r(J) and 
it can therefore be assumed that (J, t S | 2 |) is the direct sum of the groups 
J(s) with g <t S s,ie., (J, g < |x|) and consequently (J, g S | x |) are sub- 
groups of J’, i.e., b is an element of J’, i.e., J = J’ is the direct sum of the 
groups J(s). 

Note that the examples 9.4 to 9.6 satisfy (0). 

Coro.iary 9.9. Suppose that the chain condition is satisfied in B(J). Then 
J is partially reducible if, and only if, 

(a) for every genus s, (J,s < |x|) is adirect summand of (J,s S 

(b) for every genus s, (J, s < |x|) is the join of the groups (J, t 
s < tand tin Bi); 

(c) for every genus s, (J, 8 < |x|) is the intersection of( J, s < |x|) and (J, 
|x| £8). 

Proof. The necessity of the conditions (a) and (b) is a consequence of (2.6) 
and the necessity of (¢) a consequence of (2.7). 

If conversely (a) to (c) and the chain condition in B(/) are satisfied, then 
let, for every genus s, J(s) be a subgroup of J such that (J,s S |z\|) = J(s) + 
(J, s < |2|). The joingroup J’ of these groups is by (c) and (9.3.3) their 
direct sum. 


| x |); 
< |x|) with 
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Let W be the set of all those genera s in B(/) such that (J, s S 2|) € J’. 
If W is not vacuous, there exists from the chain condition a “greatest’’ genus 
winW. (J,w < |2)) is by (b) the join of the groups (J, t S |x|) withw <t 
and t in B(J) and since these genera t are not contained in W, it follows that 
(J, w < |a2|) S J’ and consequently that (J, w < |z!|) S J’. Thus Wis 
vacuous, i.e., (J, 8 S |x|) S J’, ifsin B(J). If zis the genus of the number 1, 
then J = (J,z S |x|) = J(z) + (J,z < | x |) and by (b) it follows that (J, z < 
2|) S J’, ie., that J = J’ is the direct sum of the groups J(s). 

THEOREM 9.10. If the subgroup S of the partially reducible group J satisfies 

(a) the chain condition in B(S); 

(b) for every genus s, (S, s < | x!) is a direct summand of (S,s S |2}); 

(c) (S, f(x)) is the intersection of S and (J, f(x)) for every f(x) = (s S |x )), 
(s <|2)), (|x| £8), (Jz| €8); 
then S is partially reducible and every S(s)* is isomorphic with a subgroup of J(s)*. 

Proof. J satisfies the conditions (a)—(c) of Corollary 9.9 since J is partially 
reducible. Thus conditions (a)—(c) of Theorem 9.10 imply that S satisfies the 
conditions of Corollary 9.9 and S is therefore partially reducible. 

Coro.uary 9.11. Suppose that S is a subgroup of the partially reducible 
group J and that the chain condition is satisfied in B(J). Then S is a direct 
summand of J if, and only 7f, 

(a) for every genus s, (S,s < | x |) is the intersection of S and (J,s < |2x}); 

(b) for every genus s, the elements of (S,s S | x |) represent a direct summand 
of J(s)*; 

(ce) S is partially reducible. 

Proof. Suppose first that S is a direct summand of J. Then (a) and (b) 
are satisfied, since (S, f(x)) is a direct summand of (J, f(x)) (for the discussed 
functions f(x)), and since therefore the conditions (a)—(c) of Theorem 9.10 
are satisfied in S and J, S is also partially reducible. 

Suppose now that the subgroup S of J satisfies the conditions (a)—(c). Then 
there exists a smallest partial reduction of S, S = ZZ S(s), and the subgroup 


s 
S(s) of S represents exactly the subgroup S(s)* of J(s)* which is represented 
by elements of (S,s S |x|). By (b), J(s)* = S(s)* + T(s)*. There exists 
furthermore a partial reduction of J and if J(s) satisfies (J,s S |2.) = J(s) + 
(J,s < |x|), denote by 7(s) the subgroup of J(s) which represents the classes 
of T(s)*. Then 


(J,s <\2!) = S(s) + T(s) + (J, 8s < | 2}), 
and therefore by Theorem 9.3 


J = D (S(s) + T(s)) = 2 S(s) + D Ts) = 8’ +7, 


and since the chain condition is also satisfied in B(S) < B(/), it follows from 
Theorem 9.3 that S = S’ is a direct summand of J. 
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Note that it is possible to substitute for condition (c¢) of the Corollary 9.11 
the condition (c) of Theorem 9.10. 


10. Complete reducibility. It is the object of this section to combine the 
results of the two preceding sections. 

THEOREM 10.1. The group J is completely reducible if, and only if, 

(a) for every genus s, J(s)* is a direct sum of rational groups of genus s; 

(b) for every genus s, (J, 8S < |x|) ts the intersection of (J, s S |x|) and 
(J,|2| £8); 

(ce) the set A(b) of all the genera s such that 


b = 0 mod (J, |x| £8), b # Omod (J, | 2); € 8) 


as for every clement b ¥ 0 of J finite and not vacuous; 

(d) J is the direct sum of groups J, such that the chain condition is satisfied in 
every B(J,). 

This is a consequence of Theorem 9.3 and Corollary 8.7, since the conditions 
(a)—(c) are satisfied in a direct sum if, and only if, they are satisfied in all the 
direct summands. 

Corouiary 10.2. The group J of finite rank is completely reducible if, and 
only if, for every genus $ 

(a) J(s)* is a direct sum of rational groups of genus s; 

(b) r(J(s)*) = r(J(s)**/F(J(s)**)). 

This is a consequence of Corollaries 9.7 and 8.7. 

By Corollary 9.7 it is possible to substitute for (b) the condition 

(b’) (J, s < | x |) ts the intersection of (J, s S |x|) and (J,|r| £8). 

TuHeoreM 10.3. If the subgroup S of the completely reducible group J satisfies 

(a) the chain condition holds in B(S); 

(b) (S, f(x)) is the intersection of S and (J, f(x)) for every f(x) = (s S |x}), 
(s <|z|),({z| € 8), (|r| £8); 
then S is completely reducible and isomorphic with a direct summand of J. 

This is a consequence of Theorems 9.10 and 8.8. 

An obvious consequence of Theorem 10.3 is the 

Coro.uary 10.4. Every direct summand D of the completely reducible group J 
such that the chain condition holds in B(D) is completely reducible. 

Coro.tiary 10.5. Suppose that S is a subgroup of the completely reducible 
group J and that the chain condition holds in B(.J). Then S is a direct summand 
of J if, and only if, 

(a) for every genus s, (S,s < | x |) is the intersection of S and (J,s < | x}\); 

(b) for every genus s, (S, |x| <= 8) ts the intersection of S and (J,|x| £); 

(ce) for every genus s the elements of (S,s S |x|) represent a direct summand 
of J(s)*. 

This is a consequence of Corollary 8.10, Theorem 9.3 and Corollary 9.9 (since 
every condition used is satisfied in /). 
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Coro.tiary 10.6. Let J be a completely reducible group. Then every subgroup 
S of J, satisfying for every genus s the three conditions 

(a) (S,s < | x |) és the intersection of S and (J,s < |x|); 

(b) (S,|a2| £8) ds the intersection of S and (J,|2\ £8); 

(c) the elements of (S,s S | x |) represent a closed subgroup of J(s)*; 
is a direct summand of J if, and only if, 

(1) the chain condition is satisfied in B(./); 

(2) for every genus s, J(s)* is complete or r(.J(s)*) is finite. 

Proof. The necessity of (1) may be derived from Example 9.5 and the 
necessity of (2) is a consequence of Corollary 3.6. If (1) and (2) are satisfied, 
every subgroup S, satisfying (a)-(c), by Corollary 3.6 satisfies the conditions of 
Corollary 10.5 and is therefore a direct summand of J. 


11. Separability. 

Lemma 11.1. The group J is separable if, and only if, 

(a) every finite subset of J is contained in a partially reducible direct summand 
of J; 

(b) J(s)* = (J, s S {x \)/(J, 8 < | x|) is separable for every genus s. 

Proof. Suppose first that / is separable. Then (a) is satisfied, since every 
complete reduction of a group is also a partial reduction. If F* is a finite sub- 
set of J(s)*, F a subset of J representing /*, then F is contained in a completely 
reducible direct summand D of J. In particular, (D, s S ||) is the direct 
sum of (D, s < |x|) and rational groups of genus s, (D, s S | 2 |) is a direct 
summand of (7, s S |x|) and (D,s < | x |) a direct summand of (J, s < | 2 |). 
The elements of (D, s S |x) represent therefore a direct summand of J(s)* 
which contains F* and is completely reducible, i.e., J(s)* is separable. 

Assume now that the conditions (a) and (b) are satisfied by J. Then it 
follows from (2.7) that every element ~ 0 of J(s)* has the genus s in J(s)* 
and consequently every closed subgroup of finite rank of J(s)* is by Corollary 
4.4 a direct summand of J(s)* and by Corollary 4.3 a direct sum of a finite 
number of rational groups of genus s. If now F is a finite subset of J, then 
F is contained in a partially reducible direct summand D of J. If D = >> D(s) 

s 


is a smallest partial reduction of D, the components of elements of F in D(s) 
form a finite subset F(s) of D(s). Since D(s) represents exactly a direct sum- 
mand of J(s)*, it follows from the above that every closed subgroup of finite 
rank of D(s) is a completely reducible direct summand of D(s). The closed 
subgroup F(s) of D(s), generated by F(s), is therefore a completely reducible 
direct summand of D(s) and F is contained in a completely reducible direct 
summand of J, i.e., J is separable. 

A consequence of this proof is the 

Corouuary 11.2. The group J is separable if, and only if, 

(a) every finite subset of J is contained in a partially reducible direct summand 


of J; 
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(b’) every closed subgroup of finite rank of J(s)* is a direct summand of J(s)* 
(and a direct sum of rational groups of genus s). 
THEOREM 11.3. Suppose that the chain condition is satisfied in B(J). Then 
the group J is separable if, and only if, 
(a) for every genus s, (J, s < |x|) is the intersection of (J, s S |x|) and 
(J,|2| £8); 
(b) for every genus s, J(s)* is a separable group and all the elements # 0 of 
J (s)* have the genus s in J(s)*; 
(c) for every genus s, every subgroup S satisfying 
(cl) every element ~ 0 of S has the genus s in J, 
(c2) r(S) is finite, 
(c3) S is a direct summand of (J, |x| <s), 
is a direct summand of J; 
(d) the set A(b) of the genera s such that 


b = 0 mod (J, |x| <s), b # 0 mod (J, | 2] € s) 


is for every element b ¥ 0 in J finite and not vacuous. 

Proof. 1. If J is separable, then J satisfies (a), (b) and (d) by Lemma 11.1 
and (2.7). If furthermore the subgroup S of J satisfies (cl)—(c3), then S is by 
Theorem 4.2 a direct summand of J. 

2. If the chain condition is satisfied in B(J/) and J fulfills (a)—(d), then the 
same holds true for every direct summand of J. Thus it is sufficient to prove: 
if J satisfies (a)—(d) and if the chain condition holds in B(./), every element of J 
is contained in a completely reducible direct summand. 

3. Suppose that J satisfies the conditions (a)—(c) and that F is a primitive 
set in J (see Definition 5.1). 

Assume first that F consists of exactly one element b and that s is the genus 
of b. Denote by 6 the closed subgroup of J, generated by b, and by b* the 
subgroup of J(s)*, represented by elements of 6. Since 6 is a primitive element 
of its genus s, b* is a closed subgroup of rank one of J(s)* and by (b) and Corol- 
lary 4.4 therefore a direct summand of J(s)*, i.e., J(s)* = 6* + 7T*. If T 
is the subgroup of (J, s < | x |) which contains (J, s < | x |) and satisfies 7* = 
T/(J,s < |x|), then (J,s < |x|) = 6 + 7, since the elements of 6 represent 
exactly the classes of 6*. If 7’ is the subgroup of (J, | z| < s), generated by 
T and (J, | 2| £ s), it follows from (a) and (2.4) that (J, |2| << s) = 6+ 7", 
and (c) implies therefore that 6 is a direct summand of J. 

Suppose now that F contains k elements bh, --- ,b:, 1 < k. Then there 
exists an element b;, say b,, such that |b; | <« |b. |. As proved above, J = 
b. + J’. J’ contains the elements b; , --- , bk-1, since (J,s S$ |x|) = (J’,s S 
|2|) fors £ |b |. Since J’ satisfies (a)—(c) as a direct summand of J, and 
since the elements b; , --- , b:. form a primitive set of k — 1 < k elements in 
J’, it can be assumed by complete induction that the elements bh, , --- , bk 
form a basis of a direct summand of J’ and consequently it has been proved 
that F is a basis of a direct summand of J. 
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4. Suppose that J satisfies the conditions (b) and (d) and that b # 0 is an 
element of J. Then there exists corresponding to every genus s in A(b) by (2.4) 
an element b’(s) such that 


b = b’(s) mod (J, |2| £8), b’(s) = Omod (J,s S |x), 


and there exists by Corollary 8.2 a primitive element b(s) of genus s in J such 
that b’(s) = b(s) mod (J, s < ||). The elements b(s) with s in A(b) form a 
primitive set in J. If b’ is the sum of this primitive set, then every genus in 
A(b — b’) is a true multiple of at least one genus in A(b), since for a genus s in 
A(b),b — b’ = 0 mod (J, |x| € s) holds and for a genus t such that s < t for 
every s in A(b), b = 0 mod (J, |x| < t) holds, and therefore b — b’ = 0 mod 
(J, |2| € t). 

Since A(b) < A(J) S B(J), it follows from this result that if the conditions 
(b) and (d) are satisfied in J and if the chain condition holds in B(J/), every 
element + 0 of J is the sum of a primitive set in J. 

5. If J satisfies (a)—(d) and if the chain condition holds in B(/), every element 
of J is contained in a completely reducible direct summand of J, as has been 
proved in 3 and 4. Thus it follows from a remark made in 2 that J is separable. 

Corouiary 11.4. If S is a direct summand of the separable group J and if the 
chain condition holds in B(S), then S is separable. 

For J satisfies as a separable group the conditions (a)—(d) of Theorem 11.3 
and consequently the direct summand S satisfies these conditions too and is 
therefore by Theorem 11.3 separable. 

THeEorREM 11.5. Jf J is a separable group, if every group J(s)* belongs to a class 
I, , and if the chain condition holds in B(.J), then J is completely reducible. 

Proof. Every group J(s)* is by Corollary 8.7, Lemma 11.1 and (2.7) a 
direct sum of rational groups of genus s. J is therefore by (2.7) and Theorem 
10.1 completely reducible. 

Remark. Since it can be proved that the group J of Example 9.4 is separable, 
the chain condition cannot be omitted in Theorem 11.5. 

A proof of the separability of the mentioned group J runs as follows. 

1. If v + 0 is an element of J, put z(v) = minimum of the m(v; < R,); with 
v; ~ 0. 

2. If v + 0, 2(d) = 1, let k be the index such that z(v) = m(y. < Ri);y. Then 
v =v + v” with vo; = v;, v; = Ofori < k, and v; = 0, v; =»; fork < i. 
If v’” is the closed subgroup of J generated by v’’, then 


k-1 
J= DR +074+ (Un < |x|), 
i=1 


and v is therefore contained in a direct summand of finite rank. 
3. If 1 < 2(v), thenv = vo’ + 0” + 0” with 


v; = 0, v; = 0, v; =0 (i <k), 
v; = 0, vi = 2(v)ziw, , vi = zw = (k S i), 
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where w; = m(v; < R,)7'v;, if v5 ¥ 0, w; = 0, if v0; = 0, and 
mv; < Ri); = 2(v)zi + 2, O<2<2v), ifk sin +0. 
Then 2(v’”’) < 2(v), for v’”’ an element of (J, r% < |x|) and 


&—1 
J= DR 407+ ,m < |2}). 
i=1 


Since the same argument as on J can be applied on (J, rm < |x|), by complete 
induction there exists a completely reducible subgroup V of finite rank and an 
index 7 such that v is contained in V and J = V + (J, ri < | x}). 

4. The separability of J follows now by complete induction with regard 
to the number of elements contained in the given finite subset. Note further- 
more that there exists a continuum of different genera and that therefore the 
Theorem 11.5 is not a consequence of Theorem 4.7, even if the groups J(s)* 
are all countable. 

THEOREM 11.6. The class X of groups is the class of all the separable groups tf, 
and only if, X is the greatest class of groups such that 

(a) the elements + 0 of a group J in X are sums of primitive sets in J; 

(by every primitive set in a group J in X is a basis of a direct summand of J; 

(c) if J belongs to X and S is a direct summand of finite rank of J, then J/S 
belongs to X. 

Proof. The class of all the separable groups satisfies (a)—(c) by Lemma 4.6, 
Lemma 5.2 and Lemma 5.8. If on the other hand the class X satisfies (a)—(c), 
all the groups in J are separable, i.e., the class of all the separable groups is the 
greatest class satisfying (a)—(c). 

If n is any finite or infinite cardinal number, C a complete group of rank n, 
then a group J is isomorphic with a subgroup of C if, and only if, r(J) S n. 
Therefore it is possible to transform Theorem 11.6 into the following criterion 
which does not use concepts conflicting with the antinomies of the theory of sets. 

Coroiuary 11.7. If nis any finite or infinite cardinal number, C a complete 
group of rank n and X the greatest class of subgroups of C which satisfies 

(a) the elements # 0 of a group J in X are sums of primitive sets in J; 

(b) every primitive set in a group J in X is a basis of a direct summand of J; 

(c) if J belongs to X and S is a direct summand of finite rank of J, every sub- 
group of C (or J) which is isomorphic with J/S belongs to X; 

a group J is isomorphic with a group in X if, and only if, J is separable and 
rJ) Sn. 


12. Vector-groups. 

Derinitions 12.1. If @ = (Gi, ---,G.,---) is @ set of (equal or different) 
groups, a vector (in ) is a single-valued function (of the indices v) such that the 
v-th coordinate f, of the vector f is an element of the group G, . 

The sum f + g of the two vectors f and g is defined by (f + g)v = fu + Ge - 

Thus the set of all the vectors in @ becomes the additive group V = V(@) of all 
the vectors in . 
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The group of all the (countable) sequences of integers is an example of a 
vector-group. Another example is the group discussed as Example 9.4. 

If V is a vector-group, then 
V’ is the subgroup of all those vectors f such that almost every coérdinate of 

f is 0; 

V.. is the subgroup of all those vectors f such that every coérdinate but the v-th 

coordinate is 0; 

V.” is the subgroup of all those vectors f such that the v-th codrdinate is 0, 
and V = V. + V. for every v, V’ = D> Ve. 

The following proposition is easily verified: 

(12.2) The group A is isomorphic with a subgroup B of the vector-group V(), 
satisfying V’ < B < V éf, and only if, there exist pairs of subgroups A,, Al of A 
such that A‘, and G, are isomorphic; A = A, + A. for every v; A, < A‘. forv ¥ u; 
the intersection of the groups A’ is 0." 

Finally V’ = V(@) if, and only if, ¢ is finite.” 

If n is a finite or infinite cardinal number, G any group, ¢ a set consisting of n 
groups which are isomorphic with G, the notation V(G, n) = V(@) will be used. 
(12.3) Suppose that R is a rational group of genus s. : 

(a) sis an invariant of V(R, n). 

(b) n is for every rational group R an invariant of V(R, n) if, and only if, n is 
the only solution of the equation 2° = 2".” 

Proof. Let s° be the genus of the infinite part of the numbers of genus s. 
Then there exist elements of genus t in V(R, n) if, and only if, s’ < t < sfor 
infinite n;s = tfor finite n. This proves (a). 

If n is finite, then r(V(R, n)) = nis an invariant of V(R, n). 

If n is infinite, then r(V(R, n)) = 2" is an invariant of V(R, n) and n is conse- 


2 This “approximate decomposition”’ of the group A into the groups A¢ is exactly the 
dual of L. Pontrjagin’s concept of direct decomposition of the character group of A. But 
not every approximate decomposition can be realized by a direct decomposition of A. See 
L. Pontrjagin, The theory of topological commutative groups, Ann. of Math., vol. 35 (1934), 
pp. 361-388. 

22 Though V(¢) is uniquely determined by the set ¢, ¢ is not always an invariant of the 
group V(¢). If e.g., @ consists of 2 groups, isomorphic with the additive group of all the 
real numbers, and y consists of 3 groups, isomorphic with the additive group of all the 
complex numbers, then V(¢) and V(y¥) are both complete groups of rank 2®° and therefore 
isomorphic. 

A group which is given as a vector-group is therefore only given by a ‘‘representation’”’ 
of the group and not by invariant properties. This applies also to the topology in the 
group which is suggested by this representation, since such a topology is generally not a 
natural topology even in the weak sense that all the proper automorphisms are continuous. 
Natural topologies in the stronger sense that every proper or improper automorphism is 
continuous are rather rare. For if in the abelian group A there exists a non-discrete, 
natural topology in the stronger sense, then A is direct irreducible if A is connected. 

23m is the only solution of the equation 2" = 27, if the Cantor continuum hypothesis 
2: = N.,, is satisfied. Without the hypothesis this proposition has not yet been proved. 
See W. Sierpinski, L’ Hypothése du Continu. 
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quently an invariant of V(R, n), if n is the only solution of 2” = 2". If wis 
another solution of this equation, then n and w are infinite. Let W be the 
additive group of all the rational numbers. Then V(W, n) and V(W, w) are 
both complete groups of the same rank and therefore isomorphic, i.e., n is not 
an invariant.” 

THEOREM 12.4. Suppose that R is a rational group of genus s. Then V(R, n) 
is completely reducible if, and only if, n is finite or R complete. 

Proof. If n is finite, then V(R, n) is a direct sum of n groups, isomorphic 
with R, i.e., completely reducible. 

If n is infinite and R complete, then V(R, n) is complete and therefore a 
direct sum of 2” groups, isomorphic with R, i.e., completely reducible. 

Suppose now that n is infinite and R a true subgroup of the additive group 
of all the rational numbers (R # 0). If, as may be assumed, the indices of the 
coérdinates are ordinal numbers, denote by W the subgroup of those vectors 
whose coérdinates with infinite index are 0. Then W is a direct summand of 
V(R, n) and is a V(R, &o). 

Since FR is not complete, there exists a prime number p such that pR < R. 
Denote by W, the subgroup of all those vectors in (W,s S |x|) such that 
for every integer 7 (= 0) almost every codrdinate is contained in p'R. Then 
W, is a closed subgroup of (W,s S | x}). 

If W’ is the subgroup of all those vectors in W such that almost every co- 
ordinate is 0, then W’ is a direct sum of No groups of genus s and thus W’ is 
countable. W’ < W,. 

If b is any element in W,, there exists an element b’ in W’ such that b = b’ 
mod pW,. W,/pW, is therefore countable. Since all the elements of W, 
have in W, genus s and since the p-values of their multiplicities are finite, and 
since finally W, contains a continuum of elements, this implies that W, is not 
completely reducible. 

Since (W,s < |x|) = Oand all the elements # 0 of (W,s S | x |) have genus 
s in W, and since all the elements # 0 of W, have genus s in W,, it follows 
from Corollary 8.9 that (W,s < | x |) is not completely reducible. 

Since all the elements of (V(R, n), s S ||) have genus s in V(R, n) and 
since (W,s S | x |) isa direct summand of (V,s S | x }), it follows that (V,s S 
|x|) and consequently that V(R, n) is not completely reducible. Thus the 
theorem and the following corollary have been proved. 

Corouiary 12.5. If n is infinite and the rational group R not complete, then 
(V(R, n), | R| S |x|) is not completely reducible. 

THEOREM 12.6. Jf R is a rational group, then (V(R, n), |R| S |2}) is 
separable. 

Proof. Denote by s the genus of R and by g a number of genus s. If f 
is an element of V(R, n) and its v-th coérdinate # 0, then m(f, < R) = 


24 Whether the invariants | R | and 2" characterize V(R, n) is still unknown. 
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h(f, v)k(f, v)'g, where h and k are relatively prime integers, both relatively 
prime to the infinite part of g. If f, = 0, put A(f, v) = 0, k(f, v) = 1. Since, 
if f + 0, m(f < V) is the g.c.d. of the numbers m(f, < 2), f belongs to (V,s S 
| x |) if, and only if, the l.e.m. k(f) of all the numbers k(f, v) is finite. 

If f # 0 is an element of (V, s S | 2]}), i-e., if k(f) is an ordinary positive 
integer, put h(f, v) = A(f, v)k(f)k(f, v)*. AC, v) is either 0 or an ordinary 
positive integer and satisfies h(f, v)k(f)* = ACf, v)k(f, v)™. 

If f ¥ 0 is an element of (V,s < |x|), denote by A(f) the minimum of the 
numbers h(f, v) # 0. 

1. If A(f) = 1, let f be the closed subgroup of V generated by f. Since m(f < 
J) = k(f)"g = m(f. < R) for a certain u, it follows that V = f + Vi. 

2. If 1 < A(f), then f = f’ + f”, where 


fo = Z(v)ACHACS, v) “fe , fi = 2" (W)R(f, ») fe, (f. # 0), 
and f. = f, = 0, if f, = 0, 
A(f, v) = h(f)z’(v) + 2’(v), 0S 2"(v) < Af). 


That f. and f,’ exist in R is a consequence of the definition of k(f). 

Then 1 = h(f’) and therefore, as proved in 1, V = f’ + V‘,, where wu is a 
suitable index and f’ the closed subgroup of V generated by f’. Since f” is an 
element of V“), since h(f”’) < h(f) and since Vi is isomorphic with V(R, n — 1), 
it follows by complete induction that there exist two subgroups A and B of V 
such that V = A + B,f = 0 mod A, B consisting of all those vectors b in V 
such that b,, = --- = b,, = 0 for a given finite set of indices »; , and A being 
a direct sum of q rational groups of genus s. 

Since B is isomorphie with V(R, n — q), it follows by complete induction that 
every finite subset of (V, s < |x|) is contained in a direct summand D of V 
which is a direct sum of a finite number of rational groups of genus s. 

Since this direct summand D is a subgroup of (V, s < | 2 |), this implies in 
particular that (V,s S | x |) is separable. 

By Corollary 4.4 the above result implies the 

Coro.iary 12.7. Every closed subgroup of finite rank of (V(R, n), |R| S 
|x|) 7s a direct summand of V(R, n). 

Thus the groups (V(R, n), | R| < |x|) with infinite » and incomplete R 
are examples of separable groups which are not completely reducible, though 
all their elements ~ 0 have the same genus, and which therefore by Corollary 8.5 
do not belong to a class T,. 

Coro.uary 12.8. Every closed subgroup of (V(R, n), | R| < | x |) ts separable 
and every closed subgroup of (V(R, n), | R| S |x|) which belongs to a class T, 
is completely reducible. 

This is a consequence of Corollaries 12.7, 4.4 and 8.7. 

If J is a group and s a génus such that for every closed subgroup S of finite 
rank all the non-zero elements of J/S have the genus s, it is undecided whether 
or not J is separable. 
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SOLUTION OF A PROBLEM OF F. RIESZ ON THE HARMONIC 
MAJORANTS OF SUBHARMONIC FUNCTIONS 


By Trspor Rapé 


Introduction. Let u(x, y) be a subharmonic function’ in a domain G. Con- 
sider a domain G’ comprised in G together with its boundary B’. If H(z, y) 
is continuous in G’ + B’ and harmonic in G’, and if H = uon B’, then H = u 
in G’ also, by the definition of a subharmonic function. If u is continuous, 
and if the Dirichlet problem is solvable for the region G’ + B’, then the har- 
monic function h determined by the condition h = u on B’ is clearly the one 
which yields the best possible limitation for u on the basis of the fundamental 
property of subharmonic functions quoted above. 

If however wu is a general (and therefore possibly discontinuous) subharmonic 
function, then the situation is less clear. It can be shown (R 2, p. 358) that 
there exists in G’ a least harmonic majorant h* characterized by the following 
properties. (a) h* = win G’, (b) if H is harmonic in G’ and H 2 u in G’, then 
H = h* in G’. But this least harmonic majorant did not seem to be the best 
one, as far as usefulness was concerned. At any rate, F. Riesz (R 1, p. 334) 
reserved the name of best harmonic majorant for a harmonic majorant defined 
in a different fashion, namely, in terms of the values of u on the boundary 
B’ of G’ (see 1.2), while the least harmonic majorant h* is defined in terms of 
the values of u in G’ alone. The best harmonic majorant, in the sense of F. 
Riesz, will be denoted by h and will be referred to by the letters B. H. M. 
The letters L. H. M. and the notation h* will refer to the least harmonic majorant 
described above. 

F. Riesz stated (R11, footnote on p. 334) that he established the identity 
of h and h* in various special cases. Brelot® gave an explicit proof in the 
case when the subdomain G’ is bounded by circles. It is the purpose of this 
paper to prove the identity of h and h* without any restrictions on G’, except for 
the assumption, implied in the very definition of h, that the Dirichlet problem is 
solvable for the region G’ + B’” 

The proof of this result could be based on the general theorems of F. Riesz 


Received October 8, 1936; presented to the American Mathematical Society, December 
1936. 

1 See F. Riesz, Sur les fonctions subharmoniques et leur rapport a la théorie du potentiel, 
parts I and II, Acta Mathematica, vol. 48 (1926), pp. 330-343 and vol. 54 (1930), pp. 322-360. 
These papers will be referred to as R 1 and R 2. 

2M. Brelot, Etude des fonctions sousharmoniques au voisinage d’un point, Actualités 
scientifiques et industrielles, vol. 139 (1934), p. 18. 

3 In §5 of this paper, we shall give an interpretation of this result which seems to express 
more adequately its true meaning. 
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on the representation of subharmonic functions in terms of negative mass 
distributions. It seemed desirable, however, to present a proof based on much 
more elementary considerations. The idea of the proof is to push the use of 
the method of approximation by integral means‘ a little farther than usual 
and thus to avoid the use of improper integrals which seem to obscure some 
of the facts which are essential for our purposes. Specifically, our main tool 
is a trivial modification (formula 2 in 2.2) of the classical formula of Green. 
Roughly speaking, we apply the formula not to the function itself which is to 
be studied, but to its integral mean. It seems to the author that this modified 
formula might prove useful in various other problems, also, whenever it is de- 
sirable to avoid improper integrals. 

It should be noted that the least harmonic majorant, as defined above, ad- 
mits of an important interpretation in potential theory. The potential of a 
negative mass-distribution is a subharmonic function u. The sweeping-out 
process, applied in a domain G’, leads to a new potential u* (see G. C. Evans, 
Potentials of positive mass, part II, Transactions Amer. Math. Soc., vol. 38 
(1935), pp. 201-236), and it follows immediately that u* = h* in G’, where h* 
is the least harmonic majorant of uin G’. This remark suggests various prob- 
lems, similar to the one solved in this paper, which the author plans to discuss 
elsewhere. 


1. Preliminaries. 1.1. Let G’ be a bounded domain (connected open set). 
Denote by B’ the boundary of G’. We shall say that G’ + B’ is a Dirichlet 
region if for every continuous function ¢g given on B’ there exists in G’ + B’ 
a continuous function A which is harmonic in G’ and which reduces to ¢ on B’. 

1.2. Denote by u a function which is subharmonic in a domain G. Con- 
sider a Dirichlet region G’ + B’ interior to G. As u is upper semi-continuous 
(R 1, p. 333), we have on B’ a sequence of continuous functions g, ,k = 1,2, --- , 
such that g “ uon B’ (the symbol g, \ u indicates that ¢g; is a decreasing 
sequence). Denote by H, the solution of the Dirichlet problem, for the region 
G’ + B’, with g as the prescribed boundary function. Then (R1, p. 333) 
H, \ hin G’, where h is harmonic and = uin G’. This harmonic function h 
is the best harmonic majorant (B. H. M.) in the sense of F. Riesz’ of u in G’. We 
shall list presently some of its properties. 

1.3. The B. H. M. depends only upon the values of u on B’. If we use a 
different sequence g, “. u on B’, we obtain the same h. Also, if wu, we are 
subharmonic in G and if uw = uw on B’, then uw, and us have the same B. H. M. 
in G’ (R 1, p. 334). 


4 See R 2, pp. 343 and 345 for historical references concerning the use of this method in 
the theory of harmonic and subharmonic functions. 

5 Actually, F. Riesz required that ¢, be continuous and g, \, u in the whole domain G, 
while we only require that these conditions be satisfied on B’. The equivalence of the 
two definitions can be seen immediately by the same reasoning which F. Riesz used to 
establish the facts stated in 1.3 above. The wording chosen in our text is due to Brelot, 
loc. cit., p. 17. 
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1.4. If H is continuous in G’ + B’, harmonic in G’, and H = u in G’ + B’, 
then the B. H. M. of win G’ is S H in G’ (R 1, p. 334). 

1.5. Let h be the B. H. M. of u in G’, and define in G a function @ as follows: 

_ jemG — G, 
hin G’. 

Then % is subharmonic in G. ‘To see this, take a subregion G” + B” such that 
G’ + B’CG"’,G"’ + B’ CG. Since u is upper semi-continuous in G, we have 
in G” a sequence of continuous functions g, such that g, \ uin G”’. Define 


wi 


in G” the functions 9; as follows: 


_ Join G” — @’, 
+ \Ai in G’, 


Qo 


where H;, is the solution of the Dirichlet problem for G’ + B’ with the boundary 
condition H, = g, on B’. Then by 1.3 we have @ “SS @in GG”. Thus @ is the 
limit of a decreasing sequence of continuous functions and hence it is upper 
semi-continuous in G”’. As G” was an arbitrary subdomain, @ is upper semi- 
continuous in G. To prove that @ is subharmonic it must be shown that 


2 
u(r,y) Ss 5. I a(x +recosyg,y +rsing)dg 
0 


for every point (2, y) in G and for every sufficiently small r. If (x, y) is either 
in G — (G’ + B’) or in G’, then the inequality is obviously satisfied for small 
values of r. If (z, y) is on B’, we have 


Qr 
aa, y) = ule, ») S$ [ iptreins++hiat 
T J0 


2 
<i | u(x + rcosy, y + rsin¢) dg, 
wT Jo 


since u S @ in G (see 1.2). 

1.6. If u is continuous on B’, then (see 1.3) the B. H. M. of u in G’ is simply 
the solution of the Dirichlet problem for G’ + B’ with the boundary condi- 
tion h = u on B’ (it is assumed that G’ + B’ is a Dirichlet region). 

1.7. In G’, defined as in 1.2, we have a (necessarily unique) least harmonic 
majorant h* (L. H. M.) defined as follows: (a) h* = win G’; (b) if H is harmonic 
and 2u in G’, then H = h* in G’ (see R 2, p. 358). 

1.8. Clearly, h* < h. As stated in the introduction, we shall prove that 
h* = h. Let us observe that this is trivial if u is continuous. Indeed, take 
any point (a, yo) on B’ and a sequence (x, , Yn) — (20, Yo) in G’. Since 
u < h* <h, and u(z,, yn) > ulxo, yo) = A(zo, yo) = lim h(x, , yn), we have 
h*¥ (xn, Yn) — R(tn, Yn) 70. That is, the harmonic function h* — h vanishes 
continuously on the boundary of G’, and consequently h* = h in G’. 
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1.9. The subdomain G’ being given as in 1.2, define in G a function u* as 
follows: 


. uinG — G@’, 
u* = — 
h* inG > 


where h* is the least harmonic majorant of uinG’. Then u* is subharmonic in G. 

We first show that u* is upper semi-continuous in G. Clearly, it is sufficient 
to verify this property for points on B’. Let (xo, yo) be a point on B’. As 
u* < ad (see 1.5 for the definition of @), we have, for (7, , yn) — (20, Yo), 


lim u*(rn, Yn) S lim W(Xn, Yn) S W(ro, Yo) = u(xo, Yo) = Uu*(Xo, Yo). 


The relation 


2r 

u*(x,y) Ss LA I u*(x +rcosg,y +rsing)dy 
2r 0 

is proved exactly as it was proved for @ in 1.5. 

1.10. For r > 0 define 
Ar(z,y) = - II u(x + & y + n) dédn 
rar 

t2+ni<r? 
(see R 2, footnotes on p. 343 and p. 345 for historical references concerning the 
use of this approximating function in the theory of harmonic and subharmonic 
functions). Then A‘’(z, y) is continuous and subharmonic at those points 
of G whose distance from the boundary is greater than r. If we apply the 
same process to A‘ (x, y) twice, using the same r, we obtain a function 
A? (a, y) which has a number of important properties, some of which will be 
listed presently (in this section, wu always stands for a subharmonic function). 

1.11. Take any region G’ + B’ interior to G. Then for r sufficiently small, 
A® is defined and subharmonic in a domain containing G’ + B’, and has 
continuous derivatives of the first and second order there. On G’ + B’, 
AS? \ uforr \ 0. Asa consequence, f AS -+ f u for r > 0, where f stands 
for any simple or double integral taken over any measurable range in G’ + B’ 
(R 2, pp. 342-345). 

1.12. If G’ + B’ is interior to G, then (R 2, p. 353) 





r-—0 


lim I/ AA‘ (a, y)dady < +2, A= . a 
@’+ B’ 


1.13. If G’ + B’ is interior to G, and if u is harmonic in G’, then we have 
AS? = wat every point of G’ whose distance from B’ is larger than 3r, by the 
mean-value property of harmonic functions. Consequently for every closed 
set S in G’ we have a 6 > 0 such that AAS” = 0 on S for r < 6. 

1.14. Consider any bounded domain G’ with boundary B’. By using sub- 
divisions of the plane into smaller and smaller squares, we can obtain in a 
familiar fashion an increasing sequence of nested domains G’, with smooth 
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boundaries B’, which approximate G’ + B’ in the following sense® (the properties 
to be listed are clearly not independent of each other and we state them ex- 
plicitly only for easier reference). 


(a) G, contains a prescribed point (29, yo) in G’. 

(b) G. + BL C Gin. 

(c) G. + BLCG’. 

(d) Any point (x, y) of G’ is contained in some Gj, . 

(e) To every « > 0 there corresponds an m = no(e) with the following property. 
Denote by S, the set of those points of G’ whose distance from B’ is less than e. 
Then for n > no the set S, contains Bi, . 

(f) The measure of G, converges to the measure of G’, and consequently the 
measure of G’ — Gi, converges to zero. 

(g) B’, consists of a finite number of non-intersecting simple closed curves as 
smooth as desired. In particular, Gi. + B°. is a Dirichlet region. 


1.15. We shall need the following well-known fact concerning the dependence 
of the solution of the Dirichlet problem upon the boundary conditions. Sup- 
pose the region G’ + B’ of 1.14 is a Dirichlet region. Denote by F a function 
which is continuous on G’ + B’. Let h denote the solution of the Dirichlet 
problem for G’ + B’ with the boundary condition h = F on B’, and let h, be 
the solution of the Dirichlet problem for G, + B’, with the boundary condi- 
tion h, = F on B;,, where G, + Bi), is a sequence of approximating regions as 
described in 1.14. Then h, — h in the sense that to every 7 > 0 there corre- 
sponds am = no(n) > O such that |h,—h| < 7 in G+ Bi forn>m. We 
sketch the proof for the convenience of the reader. Given » > 0, we have an 
¢ = e(n) > 0 such that 


| h(re, y2) — h(a, y:)| < s} 
} for [(xe — 21)° + (ye — ys)" < ¢, 
| F(z, ws) — Fe, yw)| <3 


for every pair of points (x , y:), (v2 , ye) in G’ + B’, since F and h are uniformly 
continuous in G’ + B’. Take then n > mo(e), where mo(e) is the quantity 
defined in condition (e) of 1.14. Consider any point (x, y) on B.,. By condi- 
tion (e), we have some point (#, g) on B’ such that [(2 — #)? + (y — 9)" < «. 
We obtain then 


| A(z, y) — h,(2, y) | Ss | h(x, y) ad h(z, 9) | + | A(z, 9) ae h, (2, y) | 
= |h(z, y) — h(@, 9) | + | FC@, 9) — Fe, y)| <2. 


6 See for instance O. D. Kellogg, Foundations of Potential Theory, concerning the 
familiar facts in 1.14 and 1.15, and in §2. 
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That is, |h — h,| < non Bi forn > nm = no(e(n)). Ash — h, is harmonic 
in G’, + Bi, it follows that the same inequality holds in Gj, also. 

1.16. The continuity of F was actually used for points of G’ + B’ close to B’. 
Hence the conclusion in 1.15 remains valid if we only know that F is con- 
tinuous in G’ + B’ — S, where S is some closed set in G’. 


2. A remark on the formula of Green.” 2.1. Suppose the functions f, g have 
continuous derivatives of the first and second orders in a domain G’, and that 
these derivatives remain continuous on the boundary B’ of G’. If B’ consists 
of a finite number of sufficiently smooth simple closed curves, then we have 
the classical formula 


(1) II (fag — gAf)dxdy = -/ (1 _ 9 2) ds, 


G 
where n; refers to the interior normal with respect to G’. 

2.2. Let there be given in a domain G a function v with continuous derivatives 
of the first and second orders. Consider a region G’ + B’, interior to G, such 
that B’ consists of a finite number of simple closed smooth curves. In G’, 
take a circle C(x», yo ; r), with centre (ao, yo) and radius r, comprised in G’ 
together with its interior. Put 


) 


‘ os 
v(x, yo) = 5 i v(x + reosg, y + rsin ¢) dg, 
T Jo 


Ux, y) = — log (x — ao)? + (y — yo)"I', (x, y) # (x0, Yo), 
(U(x, y) for [(x — ao)” + (y — yo)" = r, 
L(x, y) = | ° 24 
log r for [(x — x0)” + (y — wo) T Sr. 
Denote by h the solution of the Dirichlet problem for G’ + B’ with the bound- 
ary condition h = von B’. Then (see Kellogg, loc. cit., p. 237, footnote) 


1 al dH 
h(xo, Yo) = on [ o( — az) ds, 


where H denotes the solution of the Dirichlet problem for G’ + B’ with the 
boundary condition H = lon B’ (that is, 1 — H is Green’s function for G’ + B’ 
with pole at (xo, yo)). If B’ is smooth, then the necessary derivatives of H 
will remain continuous on B’.’ 

2.3. We apply now formula (1) first to the region bounded by B’ and 
C(x0, yo; r) for f = v,g = lL, second to the circular disc bounded by C(a, yo ; 1) 
for f = v,g = —logr and third to G’ + B’ for f = v,g = H. Combining the 
resulting equations with the expression for h(xo, yo) in 2.2, we obtain the 
formula 


? The functions 1, 1, , H depend also upon the choice of (xo, yo). However, this point 
will be kept fixed, and there is no need for notations like H(z, y; ro, yo) ete. 
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(r) 1 
(2) v' (x0, yo) = _ // (L(x, y) — H(z, y)| Av(a, y)dxrdy + h(xo, yo). 


2.4. Actually, (2) can be obtained from the classical expression for v in terms 
of Green’s function by an integration under the integral sign. Conversely, 
that classical formula can be obtained from (2) by the passage to the limit 
r— 0. But our idea is to keep r finite in (2), so that we have to deal only 
with continuous functions. We shall need a slight extension of (2) which we 
shall consider presently. 

2.5. If v has continuous derivatives of the first and second orders in G, then 
formula (2) holds for every Dirichlet subregion G’ + B’. 

This may be seen as follows. Approximate G’ + B’ by regions G. + B 
as described in 1.14. Observe first that the functions h, H, as defined in 2.2, 
actually exist for every Dirichlet subregion G’ + B’. Denote by h,, H, the 
functions corresponding to G, + B’,. We have then by 2.2 


r) 1 
(3) wv (x0, yo) = “- [ [ we, y) — H,(a, y)| Av(a, y) drdy + hi(xo, yo). 


We have h,(xo, yo) — A(xe, yo) by 1.15. Next we consider 


// L(x, y)Av(x, y)dady = [/ + EE 


As Ll, and Av are continuous and therefore bounded in G’ + B’, we have 


— 0, 


G'-Gp 


‘ . ’ 
since the measure of G’ — G, converges to zero. Hence 


I/ L(a, y)Av(2, y)dxdy — // L(x, y)Av(a, y) dady. 


Consider finally 


I H(x, y)Av(x, y)drdy = IJ H,,(2, y)Av(x, y) dxdy 


G Gn 


(4) + l/ [H(x, y) — H,(x, y)] Av(z, y) dxdy 


+ I H(z, y)Av(a, y)dady = IS? +12 4 7. 


7 ond 
G'-G, 
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We have JS — 0 because the integrand is bounded and the measure of G’ — Gi, 
converges to zero. Take now any 7 > 0. By 1.16, we shall have for n larger 
than some mp = mo(n) 


|'H — H,| < ninG, + Bi. 


Hence, for large n, 


er i< nf [ \ave, y)|drdy < off | Av(x, y) | dady. 
Ga G’ 


Thus J” — 0. It follows then from (4) that 
I H, (x, y)Av(x, y) dxdy — l/ H(z, y)Av(x, y) dady. 
Gs G’ 


Summing up, formula (3) yields, for n — «, formula (2) for the most general 
Dirichlet region G’ + B’ C G. The assumptions concerning the smoothness 
of v could be generalized in an obvious way, but this is immaterial for our 
purposes. 

2.6. If the function v of 2.5 is subharmonic in G, then the function h in 
formula (2) is the B. H. M. of v in G’ (observe that v is continuous by assump- 
tion, and compare 2.2 and 1.6). 


3. A lemma. 3.1. Lemma. Let u be subharmonic in a domain G. Denote 
by G’ + B’ a Dirichlet region interior to G, and suppose that u is harmonic in G’. 
Then u = hin G’, where h is the B. H. M. of u in G’. 

3.2. To prove this lemma, consider the sequence of approximating functions 


4 (3) 


un(r, y) = Ai), (2, y) n>wN 


(see 1.10), where N is large enough so that for n > N the function u, is defined 
in some domain which contains G’ + B’ in its interior. Denote by h, the 
B. H. M. of u, in G’. Since u, \ u, and since u, is continuous, we have (see 
1.2, 1.3) 


(5) h,—hinG@, 
where h is the B. H. M. of u in G’. Also (see 1.12) 


(6) O< Au, (x, y)drdy < M, 


G'+B’ 


where M is some finite constant. In G’, take a point (zo, yo) and a circle 
C(x, yo; r) with centre (xo, yo) and radius r, such that C(x», yo; r) is comprised 
in G’ together with its interior. On account of 2.5, 2.6 we have then (since u, 
has continuous derivatives of the first and second order) 








— “| TF) 
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Un (xo, Yo) = -5 | [we y) — H(a, y)lAun(x, y)dardy + ha(xo, yo) 
(7) “Thiet 
= 5a In a h,(2o, Yo). 


3.3. We write 


(8) I, = [ [we y) — L(z, y)] Aun(a, y)dady = [| + [I 


G? 


where S is a closed set interior to G’. Observe that l, = 1 = H on B’ (see 2.2). 
That is, H — l. = 0on B’. As l,, H are continuous in G’ + B’, we have for 
every « > 0a6 > O such that | H(z, y) — l(a, y) | < ¢ for every point (z, y) 
in G’ whose distance from B’ is less than 6. Choose the closed set S of formula 
(8) as the set of those points of G’ whose distances from B’ are 25. In formula 
(8) we have then, by (6) in 3.2, 


(9) Jy s ef Au, (2, y)dady S « I/ Au,(z, y)dady < «M3 
| G’-s G’-—s G'+B’ 


On the other hand, for n large enough we have Au, = 0 on S (see 1.13). Hence 
in formula (8) we have / | = 0 for large n. That is, 





lim | J, | < «M. 


As ¢ is arbitrary and M is fixed, it follows that J, — 0. 

3.4. By 1.11, 3.3 and by formula (5) it follows from formula (7) that 
u (xo, yo) = A(xo, yo). But u is harmonic in G’ and therefore u‘” (xo, yo) = 
u(zo, yo). Thus u(x, yo) = A(xo, yo), and the lemma is proved. 


4. The theorem. THeorem. Let u be subharmonic in a domain G. Let 
G’ + B’ be a Dirichlet region interior toG. Denote by h and h* the best harmonic 
majorant and the least harmonic majorant of u in G’ (see 1.2 and 1.7). Then 
h = h*. 

Proof. Define 

yt = UU inG — G’, 
~ \h* in G’. 


Then u* is subharmonic in G (see 1.9). Denote by h* the B. H. M. of u* in G’. 
Since u and u* are both subharmonic in G and u = u* on B’, we have (see 1.3) 
h* = hin G’. Since u* is harmonic in G’, we have (see 3.1) u* = h* in G@’. 
But by definition u* = h* in G’. Hence h* = h, as stated in the theorem. 


8 Observe that u, is subharmonic and therefore Au, 2 0 (R 1, p. 335). 
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5. Conclusion. 5.1. Let u be subharmonic in a domain G. Denote by 
G’ + B’ a region interior to G, and by h’ a harmonic function in G’. Define 
in G a function wu’ as follows: 


If u’ is subharmonic in G, we shall say that the harmonic function h’ is admissible 
for u in G’. 

5.2. We have then the following 

Tueorem. If u is subharmonic in a domain G, and if G’ + B’ is a Dirichlet 
region interior to G, then there exists in G’ exactly one harmonic function which is 
admissible in G’ for u in the sense of 5.1. 

Indeed, denote by h the B. H. M. of u in G’. Then h is admissible for u 
in G’, by 1.5. Let h’ be any harmonic function which is admissible for u in G@’, 
and consider the corresponding function wu’, as defined in 5.1. Let h’ be the 
B. H. M. of wu’ in G’. Since u and w’ are subharmonic in G and u = wu’ on B’, 
we have h = h’ in G’, by 1.3. Since w’ is harmonic in G’, we have h’ = u’ in G’ 
by 3.1. By definition u’ = h’ in G’. Hence h’ = h in G’, and the theorem is 
proved. 

5.3. This theorem implies that the best harmonic majorant can be charac- 
terized as follows: If G’ + B’ is a Dirichlet region interior to G, then, as is 
obvious from 1.5 and 5.2, the best harmonic majorant of u in G’ can be defined 
as the unique harmonic function h with the property that the function a, equal to u 
in G — G’ and to h in G’, is again subharmonic in G. 
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SUMMABILITY OF DOUBLE FOURIER SERIES 
By J. J. GERGEN 
Part I 


1.1. Introduction. We consider in Part I the M. Riesz and the Cesaro sums 
of a simple series 


(1.11) yo ap, 
p=0 
that is, the sums 


ioeLe-we, tin De. 


p<z p=0 


where, in the Cesaro sums, A> is the coefficient of x” in the expansion 


Q—2)**=14 > Az 2’. 


p=1 


Riesz’s theorem that for 0 < a@ the existence of either of the limits 


lim S,(m)/A% = L, lim o,(z)/zx* = L 
implies that of the other is of course well known.’ In Theorems I and II below 
we give some formulas expressing in a simple manner each of the sums in terms 
of the other. On the basis of these theorems Riesz’s result can readily be ob- 
tained. The derivations do not in general follow Riesz’s procedure; and as 
regards conciseness there seems to be some advantage in following this second 
method of approach. Theorems I and II can also be used to advantage to 
simplify the extensions to two variables of Riesz’s theorem given by Dr. S. B. 
Littauer and the author.’ In Part II of the present paper we shall apply these 
theorems in connection with double Fourier series. 

THeorEM I. Let 0 < a. Let k be the integral part of a. Let f(x) be con- 
tinuous with its derivatives f’, f’, --- ,f“*? for 0 < x, and satisfy 

$0) =f@ =---=f*"@ =90, 
f(z) = Tia + 2)/{T(z) (a + 1)} forl Sz. 

Received December 26, 1936; presented to the American Mathematical Society, April 11, 
1936. 

1 For a presentation of Riesz’s proof see Hobson, Theory of Functions of a Real Variable, 
vol. 2, 1926, pp. 90-98. 

2 J. J. Gergen and S. B. Littauer, Continuity and summability for double Fourier series, 


Transactions of the American Mathematical Society, vol. 38 (1935), pp. 401-435. This 
paper will be denoted by A. 
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Then the fractional derivative U(x), 
U(x) = f(x) = 1/T(k + 1 — a) [ (x — t)*f*" Oat, 
0 


of order « + 1 of f, is continuous for 0 < x, and we have 


(Cr m+1 
(1.13) S.(m) = | U(m + 1 — t)o,(t)dt - form =0,1,---, 
0 
(1.14) U(x) = O(a”), I 0, 
(1.15) U(x) = O(2****), r3> +0. 
THEorREM II. Let0 <a. Let 
(1.16) voz) = T(@+1) 2 AZ. 
p<z 


Then the fractional integral V(x), 
V(x) = ¢a(x) = 1/T (a) [ (x — t)** golt)dt, 


of order a of go, is continuous for 0 < x, and we have 


(1.17) oa(z) = 2) V(x — p)Sa(p) for 0 < z, 
(1.18) V(x) = O(2”), I @, 
(1.19) V(x) = O(z*), z—+ 0. 


2.1. Lemma for Theorems I and II. Both Theorems I and II depend on 
the following lemma. To simplify the writing we shall suppose in what follows 
that zx, y are positive numbers, and that m, n, p, q are integers, positive or 0. 
We denote by M a number independent of m, n, p, q, z, y over the range 


0< 42, 0<y; 0< ™m, 0Osn, 0S ?p, 04, 


or that part of this range specified. The symbols o, O refer to the behavior of 
the function in question in the neighborhood of ~ or (x, ~). 
Lemma 1. Let 0 < 6 < 1. Let F(x) be integrable over every finite interval 
0 <2 < mand satisfy 
F(z) = 2” *"/T(p — 8) + O@”**) 


forp = 0,1,2. Then 


Fi(x) = O(x7”). 
We have for each fixed b, 0 < b < 1, 


1'(6)F;(z) = «° | [ + I | (1 — t)*"Fo(tr)dt 
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( 


= (1 — b)* "x" F,(br) + (6 — 1)(1 — b)* *x °F (bz) 


+ (6 — 1)(6 — 2)r” | (1 — t)’ *F.(tr)dt + I (li — nt (teat 


= E(b)r* + O(x”), 


where 


r(1 — 6)E(b) = (1 — b)*"b* — (1 — Bb) 


b 1 
— (8 — 2) I a — sa —3 [ a — @ "rds. 
0 b 


Now, differentiating, we see that E’ vanishes for0 <b <1. Hence, using the 


expansion 





with z = 1,z = 1 — 6, we have 


r(1 — 8)E(b) = lim {a — b)"s* -—3 [ (a — >t “ath 


=1-— sf {a —#** — 1} dt 


= 1 — 5{1 + (1 — 8)/2! + (1 — 8)(2 — 8)/3!4+---} =0. 


The lemma follows. 


3.1. Proof of Theorem I. The continuity of f“*” implies the continuity of 
f‘**” and the truth of (1.15). On the other hand, U satisfies 


f(z) = 1/T(a+}) [ (x — t)*U (at 


because of the first half of (1.12). Hence 


T 
M 


o(t)U(m + 1 — ddt 


p=0 


m+1—p 
a, | (m+ 1 — p— u)*U(u)du 


ra + 1) Ys(m +1-— ps, = S(m). 


Accordingly, it is enough to prove that (1.14) holds. 
If a is an integer, then f reduces to a polynomial of degree afor1 < x. Hence 


U(r) = f** (2) = 0 forl <x. 
This gives (1.14) in this case. 
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Suppose then that a is not an integer, that a = k + 1 — 6, where 0 < 6 < 1. 


. * 45 3 
From Stirling’s formula 


log (x) = (x — 4) log x — x + 3} log2a + P(O)/zx — i P(t)/(t + x)*dt, 


P(t) = Dd {cos 2 nat} /{2 n°x*}, 
n=1 


it follows readily that 


a H(z) 


(a + 1)f(x) = x%e"™, 

where 
DS? H = O(z”’). 
Hence, 
T(a + 1)D”f = 2*”/T(a + 1 — p) + O(2*””’). 

Taking now in Lemma 1 

uz) = Ma + 1f***(2), 
we see that F; coincides with T'(a + 1)U and that 

F,(z) = Pla + 1D" f(z) = 2” */T(@ — 8) + O@”**) 

for p = 0,1, 2. Thus (1.14) holds, and this completes the proof. 


4.1. Proof of Theorem II. We have 
(4.11) galt) = 2) A,” (x — p)*. 


p<e 


The continuity of V and the truth of (1.19) then follows. In addition, 


p 


a(t) = Do (x — p)* 2) AD%y’ Salq) = »» V(x — p) S.(p). 


p<z q=0 
It remains then to consider (1.18). 
Let A,x" denote the q-th difference 


(4.12) Ax” = > A," (z — p)”. 
p=0 


Then, for 1 S q S n, 
z ty tq-1 

Ax" = n!/(n — q) 7 at, | dt, +++ I to * dt,. 
z-1 ty-1 tg-i-1 


| Az" | Ss n! 2” * forl Sq 2, qin, 


Hence 


3 See, for.example, Bieberbach, Lehrbuch der Funktionentheorie, vol. 1, 1923, pp. 297-308, 
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4,2" = n}, Aauiz” = 0. 
Now if @ is an integer, we have 
V(z) = Aeuz* = 0 
fora + 1 < 2; and this is (1.18) in this case. 


Suppose then that @ is not an integer, that a = k + 6, where 0 < 6 < 1. 
Denoting by u the largest integer less than z, we have 


3 
é 
z a 
¥ 
yt 
: 


go(x)/T(a + 1) = — = xz *"/T(—a) + O(x-*). 


In addition, for each fixed positive n, 


¢n(x)/T(a + 1) 


n—l 
A5*;*** + 1/al 2, Ast *** als — 2 + @)* 


q=0 


. en 


n—l 
_ 20) 40(e78 4 Be) 


q=0 


= 2" *"/T(n — a) + O(2"™”). 
Taking 

Fo(z) = ox(z)/T(a + 1), 
we see that F; coincides with ¢./T'(@ + 1), and that 


: F,(z) = ¢n+p(x)/Tla a 1) = a '(p as 6) + O(2”**) 

: for p = 0,1, 2. The lemma is then applicable and the theorem follows. 
f Part II 

i 


5.1. Introduction. We consider here a function f(u, v) integrable L over 
(0, 0; 2x, 27) and with period 27 in each variable. We shall suppose in addition 





j that f is even in each variable, and shall consider only the Fourier series of f 
| at the origin, that is, the series 
HH (5.11) a 
t m, n=0 
i 
where 


Sasa ™ Agha [ cos mu aw | cos nu f(u, v) do, 
0 0 
Ayo = 1/z, Ar = Ae = +++ = Qo. 


These simplifications involve no loss in generality. Our object is to complete 
some of the results on Cesaro and Riesz summability of the series (5.11) given 
in A. We are interested in particular in the question whether the rdles of sum- 
mability and continuity can be interchanged in the following Theorem VI of A.‘ 


‘ This question is raised on p. 415. 
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THEeoREM VI. Suppose that 
OSa<a- 2, Os B<b-2 


and that the series (5.11) is summable either (C; a, 8) or (R; a, 8) to sums. Then, 
if f is bounded (C; a, b) in the square (+0, +0; 6, 6) for some 6 > 0, it follows 
that f is continuous (C; a, b) with limit s.° 

5.2. In connection with Theorem VI it might be noted here, incidentally, 
that it carries with it the following permanency theorem. 

THeoreM III. Suppose that a, b, a, 8 are non-negative numbers, that f is 
continuous (C; a, b) with limit s, and that the series (5.11) is summable either 
(R; a, B) or (C; a, 8) to sum S; then 


, « 


(5.21) s= 


> 


The proof is immediate. We set 
E=a+a-+3, n=6+6+3. 


Then, since f is continuous (C; a, b) with limit s, it is continuous (C; &, 7) with 
limit s.° On the other hand, since f is continuous (C; &, 7), it is bounded (C; £, n) 
in some square (+0, +0; 6, 6), 0 < 6. Thus, making use of our summability 
hypothesis and Theorem VI, we see that f is continuous (C; &, 7) with limit S. 
We conclude that (5.21) holds. 

5.3. We turn now to the question on Theorem VI. The result that one 
might expect would be that continuity (C; a, b) of f, plus ultimate boundedness, 
either (C; a, 8) or (R; a, 8), of the series (5.11), implies summability of the 
same type and order of the series for a, 8 sufficiently large. Peculiarly enough, 
this is false. This we shall show in paragraphs 6.1 through 7.1, the result of 
which we state in the form of 

THEOREM IV. Corresponding to every set of non-negative numbers a, b, a, 8 
there can be constructed a function f(u, v) integrable L over (0, 0; 2, 3), even and 
periodic with period 2x in each variable, and continuous (C; a, b), for which the 
series (5.11) is ultimately bounded both (R; a, 8) and (C; a, B), but is summable 
neither (R; a, B) nor (C; a, B). 

Actually we shall prove somewhat more than is stated. The function f we 
construct will vanish on (0, 0; 32, 7). In addition, the series (5.11) will not 
only be ultimately bounded but bounded both (R; a, 8) and (C; a, 8). 

5.4. The natural analogue of Theorem VI being false, one might ask if it is 
possible to obtain any result of the same general character. In our final the- 
orem, the proof of which is in paragraphs (8.1) through (9.1), by requiring ul- 
timate boundedness of two different orders, we obtain a result of this kind. 


5 The definitions for R and C summability can be found on pp. 401-402 of A, those con- 
cerning the continuity and boundedness of f, on p. 413. 
* This follows directly from Theorem ITI, p. 413 of A. 
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THEOREM V. Suppose that 
(5.41) OSa<é§, Osa<éi-l, 0sb<>n, 0=B<7n-1 


and that f is continuous (C; a, b) with limit s. Then, if the series (5.11) is ulti- 
mately bounded both (R; a, n) and (R; &, B), it is summable (R; £, 7) to sum s. 
In addition, the corresponding result with C in place of R holds. 


6.1. Lemmas for Theorem IV. 
Lemma 2. For every m there exists an even, bounded and measurable function 
I ,,(u), with period 2x, which vanishes on (0, 4x), and whose Fourier series, 
wo 


Tn(u) ~ Cy, m COS pu, 
0 


p= 
satisfies 
Co, n = Ci, — “°° = Cam = 0, C'n+1, m a 0. 


To prove the lemma it is enough to show that, m being fixed, we can choose 
constants bo, bi, «++ , bm, so that 


m+1 


(6.11) DX Ap.a be = Vor, 
q=0 
according as p S mor p = m + 1, where 


A».q = / cos pu cos qu du. 


For, with the b’s so chosen, the even, periodic function J,, which vanishes on 
(0, $x) and coincides with 

m+1 

> b, cos qu 

q=0 
for 34 < u S x has the desired properties. 

Now a short calculation shows that A,,, is rational for p # q and has the 
value 1/(2A,) for p = q. Thus the determinant A of the A’s can be written 
in the form 

A= ae a + ne + oe + Fase, 
where the r’s are rational. Since 7 is not algebraic, it follows that A is not 0. 
Accordingly the equations (6.11) admit a solution, and the lemma follows. 
6.2. Lemma 3. Let 


(6.21) 0< tit << & = 7; €p/Cpi1 > ®, p- ®. 


Let ¢,(v) be the even, periodic function which coincides with v for e)4, < v < ep 
and equals 0 elsewhere on (0, 7). Let Rg(y; ¢p) be the Riesz mean, 


n<y 


Ray; gr) = Lo (1 — y/n)? rn [ cos nv gp(v) dv, 
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of order B, 0 & 8, of the Fourier series of ¢, at the origin. Then i 
(6.22) | Rely; >) | < M/ebss, 
and, for every positive &,' : 





(6.23) ef, Rs(E/epi13 Gp) > 2 h/x [ yasi(v)/v" dv, p—®. 


Suppose first that 0 < 8. We have 


(6.24) Rs(y; ¢p) = 2y/m [ var (yr) o,(v) dv 
0 
= 2y/m || + / | Ya41 Gp dv 
= F\(y) + Fily), 
say. 


Applying the second mean value theorem, we get 


" 


| Fil = | 2y/(m ebss) ya4i(yv) dv (Cpa SS ep) 


ept 
< M/ebus [ eld to <6 Mepehsa. 
0 


In addition, since e,/epi1 — ©, we have 


2 


fep/ep+1 , 
ear Fi(t/epy:) = 2 /m I yeu(v)/v' dv > 22 /x | yasi(v)/v' dv. 


On the other hand, for 1 < y, 


Fa |< My / Teer 


< M(iy*+y"') i (v?" + v*) gp(v) dv < M. 


We conclude that (6.22) holds for 1 S y, and that (6.23) holds. Since Rg(y; ¢,) 
= Rs (1; ¢,) for y < 1, the lemma follows for 0 < 8. 

The proof for the case 8 = 0 is similar and can be omitted. In this case the 
formula 


e 


Roly; gp) = 2n/x I yi(nv) /v'dv | 


ep+1 


ep 
+ x [ [sin nv {cot v/2 — 2/v} + cos nv]/v' dv, 


pti 
where n is the largest integer less than y, takes the place of (6.24). 


7 For the definition, properties and references concerning y,,,, see A, p. 416. 
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7.1. Proof of Theorem IV. Turning now to the proof of the theorem, we first 
select an infinite sequence of functions {#,(uy}, n = 0, 1, --- on the basis of 
Lemma 1. 


In the first step we set m) = 0 and choose % so that #9 has the properties 
specified for J,, and satisfies ; 


|| <1, to = max | R,(z; %o) | < 1. 
0<z 
In the second step, noting that 0 < & and that 
R.(x; &o) — 0, t-—> @, 
since yp vanishes for | u | < 32, we choose an integer np < m, so that 
| Ra(x; Bo) |/to < 3 


form +123 2. We then choose 4; so that #, has the properties specified for J,, 
and satisfies 


|| <1, 4 = max | Ra(z; #1) | < $b. 
Continuing this process, in the (p + 1)-th step we select n,. < n, so that 
(7.11) x | Ra(x;@q) |/tg <2” for no +172. 
We then choose #, so as to have the properties specified for I, and satisfy 
|o,| <1, tp = max | Ra(x; ®,) | < tp1/2”. 


The function f(u, v) we now define as 


J(u, 0) = 3 ,(u) 99(0), 


where the g’s are the functions of Lemma 3 with 
eo = TF, = &, @ = t, 
Since 0 < tp41 < t,/2””’ < 4, these e’s satisfy (6.21). On the other hand, 


®,, gp are even, periodic and integrable, and 


®,(u) = 0, > | &,(u) dp(v) | < 0, 


the former for 0 S u S }z, the latter on (0,0;2, 7). Hence f has all the proper- 
ties asserted save perhaps those concerning summability. 
We consider the Riesz mean 
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Re, a(t, y) = Do (1 — m/z)* DS (1 — n/y)* any n- 


m<z. n<y 
We shall show that R., 3 is bounded for 0 < x, 0 < y and that 
(7.12) lim | Ra,.s| > 0. 


(z, y)—*(, ©) 
From these two facts the theorem follows. For first, by Theorem II of A, 
boundedness (R; a, 8) implies both boundedness (C; a, 8) and the equivalence 
of summability (2; a, 8) with sum s to summability (C; a, 8) with sum s. And, 
secondly, by Theorem III, since f is continuous (C; 0, 0) with limit 0, the series 
(5.11) cannot be summable at all unless it is summable to 0. 
Now 


Ra, a(t, y) = > Ra(x; Bg) Raly; vo): 


In addition, for z © np41 + 1, p SQ, 
R(x; ,) = 0; 


and accordingly, for x S nyu. + 1, 


Pp 
Ras = > R(x; &,) Raly; ¢q)- 
& 


Now, form + 1S n,+ 1 S 2,0 < y, we have 


p-l p-l 
2, Ral; &q) Roly ee) | <M Do | Bala; %,) |/ta < M/2’ 
| eo | q= 


by (6.22) and (7.11). On the other hand, for0 < p,0 < 2,0 < y, 
| Ra(z; Pp) Raly; vp) | < tpM/tp = M. 


We conclude that R,,3 is bounded for 0 < z,0 < y. In addition, choosing z, 
so that 


t, = R.(2p ; ?,), 
we have 
Np +12 S ny + 1. 
Hence, using (6.23) and choosing é so that 
/ ves (v)/v' dv ¥ 0, 
é 
we have 


iim | Ra, s(t, y) | = lim | Ra(x,; &,) RelE/t3,; gp) | 
p72 


(2, y) 7 (e, @ 
] x l 
= 22/5 I ysi(v)/v' dv | ¥ 0. 
g ' 


This is (7.12) and completes the proof. 
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8.1. Lemmas for Theorem V. 

Lemma 4. Let0 <a <a-—1. Let g(u) be integrable over (0, +), even, and 
with period 2x. Let (1.11) be the Fourier series of ¢ at the origin, and let it be 
bounded (C, a) or (R, a). (The two are equivalent.) Let mo be an arbitrary posi- 
tive integer. Then we can write 


galt) = Do gp(x) ay + gmg(x), 


p<mg 
where the g’s are measurable for 0 < x, go, «++ ,Gmo-1 are independent of y, and 
| gp(x) | < M* 2x" for p < mo, | gm (x) | S M* 2x* Lub. | o.(u)/u"|, 


mos 
M* being independent of ¢ as well as x and p. In addition, an expansion of the 
same type is valid with 


| gmo(x) | S M*x* Lub. | Sa(m)/m* |. 


mosm 


Denoting by ¥. the number and H(u) the function of paragraph 9.1 of A, 
and applying Lemma 11 of A, we have 


galt) = vat" [ H(xu) og(u) du 


-_ \ 3» {vent [ H(xu)(u — p)* au} a, + vaste | H(xu)o,(u) du 


P<mo 


= ~ 9p(x) ap + GYmg(X), 


p<mo 


say. Now by Lemma 10 of A, 


/ | H(xu) (u — p)*| du < a2" [ u* | H(u) | du < M*/2*"", 
Pp 0 


0 mos 


[ "1 H(u) oe(u) | du $ 27° Lub. | oe(u)/u* |. 


Thus, since H(xu)(u — p)*, H(xu)o.(u) are superficially measurable, the first 
part of the theorem holds. 
To obtain the second part we use Theorem II. We have 


er) = D> {vant / : H(2xu)V(u — p) au} S.(p) 


p<mo 


+ bes [ H(zu) X V(u— p)Se(p) du; 


0 mosp<u 


and the second part follows from the properties of V and the definition of S,(p). 
8.2. Lemma 5. If0 S a < £,0 S B < n, and if the series (5.11) is ultimately 
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bounded both (R; a, n) and (R; &, 8), it is ultimately bounded (R; §, n). In addi- 
tion, the corresponding result with C in place of R holds.* 
Consider the Riesz sum 


O%, (2, y) = p (z - p)* i (y - q)" Qp,q- 


p<z q<y 


Under our hypotheses there is a positive mp such that 


IIA 
cs 


low.a(t, y)| < Mx*y’, — | o¢,a(z,y)| < Mz'y’, — for mo < x, mo 
Now, for mp S x, m S y we have 


T(a@ + 1)T(E — a@)oz,,/TE + 1) = [ i Oa, a(t, y) dt 


= >. " (2 — t)**" (t — p)*dt yi (y — q)’ Gp, ¢ 
(8.21) »<mo Jp _ 


+ [ (x — t)**"* oa, a(t, y) at 


say, and 
(8.22) jo? |< May’, 


(2) | 


|e | <Mlonl+leo 


<M > (y — g)"*" (t — g)? dt DS (x — p)*| apg | + Marty’ 
(8.23) econ Je ome 


< F(z) y'’, 
where F is independent of y. But 
mo ( 
o” aty” = » et | (x — yo (t — p)* ix \2 (1 — g/y)’ ay, : 
p<mo P q<y 


is a sum of the form considered by Agnew in his lemma on double series,’ and 
mo 
i i (x — t)**" (t — p)* dt = o(1). 
Pp 


We conclude from the lemma and (8.23) that o”” satisfies (8.22). The proof 
is then complete for Riesz sums. 

The proof for Cesaro sums is analogous, the formula taking the place of (8.21) 
being 


8 The fact that (5.11) is a Fourier series is of no importance. 

® For a statement of and references to Agnew’s lemma see A, pp. 402 and 409. As stated 
in A, Agnew considered the case in which z, y are integral variables. The immediate 
extension to continuous variables is given on p. 409 of A. 
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Sz, (m,n) = >, A&-%"S,,, (p, n). 
p=0 
9.1. Proof of Theorem V. Since continuity (C; a, b) implies continuity 
(C; a’, b’) for a < a’, b < b’, we can suppose that 
OSa<a-1<t-1S5h O58 <b-1<n-1Kk, 


where h is the integral part of a, k the integral part of b. Also, for the usual 
reasons, we can assume that s is 0. 


Consider then o;,,. Let 0 < «€ be arbitrary. Let 0 < 6 be chosen so that 


| fa,o(x, y) | < ex*y’ forz S b,y S 6. 


Let ¥, Ya, % be the numbers and H(u), K(v) the functions of paragraph 7.2 of 
A with é in place of a and 7 in place of 8. Then we have, by Lemma 9 of A, 


Etat]  nt+b+l 
Cr, =r y" [ 


(0, 0; ©, 2% 


“Leesa Lan hase * ave 
(0, 0; 8, 3) (8, 8; 2, 2) (8, 0; 2, 2) (0, 8; 8, 2) 


yartett yt! HK f, 4 d(u, v) 


H(xu) K(yv) fa,o(u, v) d(u, v) 
) 


4 
- 2% 


i=l 


say. Now by Lemma 6 of A, 
(9.11) |n| < Naty’ I | H (u) | u* du [ | K(v) | v° dv < Na‘y’e, 
0 0 


where N is independent of ¢, x, y. Next, since 1 < a, 1 < b, we have 


| fa.o(x, y) | < Ma*y’ ford Saz,5 Sy 


and thus 


(9.12) lre| < aay’ [ yet au | vo” dv < Mz’y’. 
ry 


It is enough then to show in the case of R summability that rs; = o(z‘y"), r= 


o(x*y"). Moreover, since the situation is symmetrical we can confine ourselves 
to rs. 


We first apply Lemma 5, choosing a positive integer mp» so that 
| oe.e(, y)| < Ma'y’ for m < x, m S y. 


Then we note that 
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Irs] Sfloneltinl+ inl}+ ln] 
= o| atv +/y"" i : Ki) { Ee — p)*r, 


p<z 


(9.13) [ cos pu fo,o(u, v) in} dv | 


=O {ay oy / | K(yv) | iw [ | fo, o(u, v) au} 


= 0 (x**ty’). 
Next, we write 
ry = per**** | H(xu) {x (y — q)" | cos qu fa, o(u, v) av} du. 
5 a<u 0 
Setting 
elu, y) = Lily ara [cos au, 0) ay 
a<u 0 
we have, for a fixed y, 
gat, y) = Dy (y— 4)" mf cos qv fa,o(u, v) dv. 
qa<y 
In addition, ¢ is an even periodic function of u. Its Fourier series at the origin is 
- Xp {= (y — q)" Aq Gp, . 
p=0 a<y 
which, for m < y, is bounded (R, a). Accordingly, using Lemma 4, we can 
write 


valu, y) = Dd gp(u)  (y — 4)" Acar. g + 9(u,y) form Sy, 
q<y 


p<mo 
where the g’s are measurable for 0 < u, go, +++ , Ym o-1 are independent of y, and 
| gp(x) | < Mzx* for p < mo, | g(x, y) | < Ma*y" for m & y. 
Thus, 


ta— Do var [ H(axu) gp(u) du » (y — q)" Ada, @ 


p<mo 


=0 {te | [ g(u, y) H(xw du \ 
6 


= O(zr"y") = o(a’y’). 


But the sum 
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- {ver" ‘ H(xu)gp(u) au) {x (1 — g/y)" Nee. 
(9.14) p<mo 8 a<u 


= rex ‘y" + o(1) 


is again one to which Agnew’s lemma is applicable. Since 
Wer” I H(au)g,(u) du = O(2**) = o(1), 
3 


we conclude, on applying (9.13) and the lemma, that the sum is o(1). From 
(9.14) we then see that r; = o(x*y’), and this completes the proof for Riesz 
sums. 

The proof for Cesaro sums is of the same type but slightly more complicated. 
We first choose f, f so as to satisfy the conditions imposed upon f in Theorem I 
with a replaced by &, ». Then we have 

n+l 


m+1 
Sz, (m,n) = [ ~U(m+i1—- naz [ U(n + 1 — y) o¢,9(z, y) dy, 
0 0 


where U, U are the fractional derivatives f**, f°*”. Letting «, 5, r; have the 
same significance as above, we write 


4 
Si.» = ~ 


i=1 


m+1 n+ 
0 


1 4 
Um +1 — adr f On+1—y)rndy= DS, 
0 i=] 


say. 


We see readily by (9.11) that 
mt+1 n+l 
[Si] < Nom + Dnt are [~ | OG) fae [| Oa) | a 
0 0 


< N(m + 1)§ (n + 1)" «, 
where N is independent of m,n, «. In addition, we have, by (9.12), 
| Se | < M(m + 1)* (n + 1)’ = o(m‘n’). 


Accordingly, because of the symmetry, it is enough to show that S; = o(m*n”). 
We first apply Lemma 5, choosing a positive integer mo so that 


| Se. | < Mmi'n’ for mo < m, mM 


IIA 


n. 


Then we note that 
|rs| S M(x + iy" [ | K(yv) | dv [ | fo.o(u, v) | du 


< M(z + 1)" y’, 
and, accordingly, that 
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m+1 
8 = Of] Se.) +151 + 51+ [ | U(m+1—2)/\dzx 


ss n+1 
(9.15) [ |U(n+1—y)rs| dy 


= O{m*** n’}. 
Next, we write 
n+l _ C) 
[ U(n+1—y)rsdy = vation | H(xu) 
0 5 


q=0 


and reasoning as with r; we see that 


n+l x n 
[ Urs dy — >» vor? | H(au)g>(u) du p 3 At—aNq Gp, ¢ 
0 6 q=0 


p<mg 
= per | H(2xu) g*(u, n) du, 
r) 
where the g*’s have properties analogous to the g’s. Thus, for m) S n, 


h- 2 {ve ia U(m + 1 — a)a**** az | H(xu)g>(u) in} 


pP<mo 


( n 
\2 Aj-ahe Ap.¢@ 


q=0 


m+1 C) 
vf U(m + 1 — 2)a****" az [ H(xu)g*(u, n) du 
0 6 


m+l1 C-} 
on | |U | 2* dx | oe au) O(m*n"). 
0 3 


Making use of (9.15) and applying Agnew’s lemma to the sum 


m+1 ” — 
> {mr [ U gitett dr i Hg. in} (= Alan "Ng Ap, 0, 
p<mo 0 - 


we conclude that S; = o(m‘n”), and this completes the proof. 
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STRUCTURES AND GROUP THEORY. I 
By OysTEIN ORE 


In a recent paper on the foundations of abstract algebra’ I have shown that 
the principal results on algebraic domains are not primarily to be considered 
as properties of the elements of the domain itself but as properties of certain 
systems of distinguished subsets, like systems of subgroups, ideals, submoduli, 
etc. These systems of subsets have the common characteristic property that 
they form a structure, i.e., a system in which union and cross-cut of two elements 
are defined. The theorems on algebraic domains are shown to be theorems on 
structures. This explains the well-known similarity of several algebraic theories 
and makes possible a unified structural theory applicable to all systems: 

After this common foundation for the algebraic theories has been established, 
it is, however, not difficult to see that the various algebraic domains like fields, 
rings and groups have peculiar structural properties of their own. In certain 
cases it is even possible to characterize the domains by these properties. 

In the following we shall apply the principles of the theory of structures to the 
foundation of the theory of groups, that is, we base the theory as far as possible 
directly upon the properties of subgroups and eliminate the elements from 
theorems and proofs. This entails a certain simplification. More important, 
however, is the fact that this method, even in the elementary theory of groups 
which we consider in this paper, leads to new systematic points of view and 
interesting new results. 

In Chapter I we discuss the structure formed by all subgroups of a given 
group and indicate the general principle of duality. Furthermore, in the theory 
of structures we have constructed a quotient structure for any structure, while 
quotient systems A/B in groups have been defined only when B is normal in A. 
Hence we are led to the introduction of quotient systems for all subgroups. 
The algebraic system A/B is then a multi-group differing from ordinary groups 
only in the property that the product is not unique. 

In Chapter II we consider the law of isomorphism. When the assumption of 
isomorphism is weakened to co-set correspondence we are led to permutable 
groups. Such groups have the structural property expressed in Theorem 6. 
When structure isomorphism is required, one is led to the important type of 
subgroups which I have called quasi-normal subgroups. 

Some of the principal new results are to be found in Chapter III. The 


Received February 10, 1937. 

1 Oystein Ore, On the foundation of abstract algebra I, Annals of Math., vol. 36 (1935), 
pp. 406-437; II, ibid., vol. 37 (1936), pp. 265-292. These two papers will be cited as Founda- 
tions I and IT. 
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theorems of Jordan-Hélder and Schreier-Zassenhaus are analysed and several 
extensions of these theorems are obtained. When only index relations are 
required certain weak permutability conditions of the chains are required. 
When structure isomorphism is wanted we have to consider chains in which 
every term is quasi-normal in the preceding. For such chains there exists a 
complete analogue of the theorem of Jordan-Hélder. 

The last chapter deals with the Dedekind structure formed by all normal 
subgroups. Here some of the results may be taken over directly from the theory 
of structures. The concept of direct similarity is shown to be equivalent to 
central isomorphism. The well-known duality between the center and the 
anti-center, i.e., the quotient group with respect to the commutator group is 
easily explained by the general principle of duality. Among the further results 
of this chapter I shall only mention the interesting self-dual Theorem 4. 


Chapter 1. Quotient systems 


1. Group structures.” The system of all subgroups of a given group G forms 
a structure 2. To any two given subgroups A and B of G there exists a cross-cut 
(A, B) consisting of the common elements of A and B, and a union [A, B], which 
is the subgroup generated by A and B. Since this structure has the property 
that any finite or infinite set of subgroups has a cross-cut and union, we shall 
say the structure = of all subgroups of a given group G is closed. 

The cross-cut and union satisfy the ordinary axioms for these operations: 


(A, B) = (B, A), [A, B] = [B, A], 

(A, A) = A, [A, A] = A, 

(A, (B, C)) = (A, B), ©), (A, [B, C]] = [[A, B], Cl, 
(A, [B, A]) = A, [A, (B, A)] = A. 


It is obvious that the two operations correspond dualistically. This simple 
remark enables us to express an important principle which we shall repeatedly 
apply. 

PrincipLe oF Duauity. To any structure theorem on groups corresponds a 
dual obtained by interchanging the two concepts union and cross-cut. 

In the following we shall consider various substructures of the structure = of 
all subgroups. Two structures 2; and 22 shall be said to be structure isomorphic, 
2, ~ 22 if there exists a one-to-one correspondence between the subgroups of the 
two structures such that if 


A,\@ A:2, B, = Ba, 


then 


(Ai ? B,) e (Ag ’ B:), [Ay ’ B,) 2 [Ae ’ B). 


If A > B are two subgroups of G, we shall denote the index of B in A by 
2? Compare Foundation I, Chapter 1. 
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{A:B}. The two structures 2, and 2: shall be said to be strongly structure 
isomorphic if for any two subgroups A; > B, and the corresponding A; > By, 
we also have the same index 

{A,:B,} = { As: Bs}. 


In this case we shall write >; ~ 22. 


2. Quotient systems.’ In an arbitrary structure = any two elements A = B 
define a substructure consisting of all elements H between A and B 


A2H 2B. 
This structure shall be called the quotient structure of A and B and we shall 
denote it by A = A/B. 

In the case where B is a normal subgroup of A, we ordinarily associate a 
quotient group with the symbol A/B. We shall now show that in the non- 
normal case it is also possible to associate with the quotient A/B an algebraic 
system which in the case of a normal subgroup reduces to the quotient group. 

We represent the group A by means of its left-hand co-sets with respect to B: 


(1) A = {a,B,mB,---} =mB+mBt+-::-, 


where the a; are suitably chosen elements of A. It should be observed that 
it is sufficient to consider only one set of co-sets (1), since the right-hand co-set 
representation may be obtained from (1) by the inverse automorphism a — a” 
in A: 


A = {Ba;", Bay’, --- } = Bay’ + Baz’ +---. 


The totality of left-hand co-sets (1) shall be called the quotient system of A 
with respect to B and we shall write 


(2) A/B = {B,, Br, --: }, 


where the B; = a;B denote the co-sets in some order. This quotient set will 
now be made into an algebraic system through the definition of a multiplication. 
This multiplication differs, however, from the ordinary multiplication in alge- 
braic systems by the property that two elements do not define a unique product 
element, but a product set consisting of several elements. Let B, and Bs be 
two co-sets. The elements of A contained in the complex 


(3) B,-Bs = aa°B-a3-B 
will then give a certain subset of co-sets in (2) 

S= {By, , By, iia }, 
where each B, is of the form 


’ Foundation I, Chapter 3. 
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B, = a,:b-ag-B, 
and b is an element of B. We now define the product of two co-sets as 
(4) B,: Bs; = {B,,, By, , eee }. 


It is natural to extend this definition of the product of co-sets to the product of 
arbitrary subsets 


S = {B,,,--+}, T = {Bs,,---} 


of A/B by saying that S-T is the set of co-sets containing all products B,-B; . 
This multiplication is obviously associative. 

Let us now mention a few of the properties of the new multiplication. The 
product B,-Bs contains the co-set a,-ag-B. In the multiplication, the group 
B plays the rdéle of a right-hand unit element with the properties 


B,:B= B,, B-B,> Ba 


for all co-sets B,. One may define an inverse of a co-set B, as a co-set Bs such 
that 


(5) Bs-B. > B. 

This relation is equivalent to the existence of elements 6, and bz in B such that 
(6) ag-b-da = be. 

This relation may always be satisfied by taking b; equal to the unit element 


e and as = a,'. By taking the inverse of the relation (6), one finds that (5) 
also implies 


B,-Bs > B. 

Hence we have shown 

THEOREM 1. Every element of the quotient system A/B has at least one inverse, 
and if Bg is an inverse of Ba , then B, is an inverse of Bs . 

Let us determine when a product is unique. If a,-B and ag-B have a unique 
product, we must have 

aq°B-ag-B = aq:ag-B, 

and hence for every }; in B there must exist another bz such that 


a,°b;-a3 -_ Aq* 3+ be 


a3 '-by-a3 = be. 
This shows 
TuHEeoreM 2. A product B,-Bgs is unique if and only if Bz belongs to the nor- 
maliser group of B in A. 
Similarly one proves 
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THEOREM 3. The normaliser group of B with respect to A consists of all co-sets 
with a unique inverse. 


3. Multigroups. The properties of a quotient system naturally lead to the 
definition of a new abstract system which may be called a multigroup. It may 
be defined as a system IM of elements satisfying the following axioms: 

I. The product of two elements B, and Bs is a subset of M: 


Ba Bs = {B,,, By, +-* }. 
More generally, the product of two sets of elements 
S = {Ba,, Ba,,°*: }, T = {Bz,, Bs,, - 


is the set defined by all products B,- Bs . 
II. The multiplication is associative. 
III. There exists a unique right-hand unit element B, such that for all B. 


B,-B, = B., BB, > B,. 


IV. To each B, there exists at least one inverse BY, such that 


B,:B, > B, B.-B. > By. 


This definition shows that a multigroup differs from an ordinary group only 
in the property that the product is not unique. Obviously any quotient system 
A/Bisamultigroup. In the case of a multigroup arising from co-sets in a group 
there are further axiomatic conditions satisfied which have not been enumerated 
above. Although these multigroups have several interesting properties, the 
study of these must be reserved for a later occasion. I shall only mention 
here some recent investigations by Marty‘ on the subject. The multigroups 
are closely related to the “Mischgruppen” studied by Loewy.” It seems to me 
that the representation of the quotient systems by means of multigroups is pref- 
erable to the representation by means of ‘“Mischgruppen”, since one avoids 
the double system of operators which the latter theory introduces. 

We shall complete these remarks by proving one theorem which is of impor- 
tance for the sequel. Let A > B be two subgroups of G and A, any sub- 
group of A containing B. The group A; consists of certain co-sets of A/B and 
these co-sets are seen to form a submultigroup of A/B. We shall show con- 
versely that every submultigroup A; of A/B constitutes a subgroup of A con- 
taining B when considered as a set of elementsin A. According to the definition 


4‘ F. Marty, Sur les groupes et hypergroupes attachés a une fonction rationelle, Annales de 
l’Ecole Normale Supérieure, (3), vol. 53 (1936), pp. 83-123. A list of further publications: 
on this subject by the same author is given in the introduction. After the completion of 
this manuscript appeared a paper by H.S. Wall, Hypergroups, Am. Journal of Math., vol. 
59 (1937), pp. 77-98. 

5 A. Loewy, Uber abstrakt definierte Transmutationssysteme oder Mischgruppen, Journal 
fiir Mathematik, vol. 157 (1927), pp. 239-254. See also R. Baer, Sitzungsberichte, Heidel- 
berg, 1928. 
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A, contains the product of any of its co-sets. Hence according to a preceding 
remark it contains the product of any of its elements. To prove that A; con- 
tains the inverse aj,’ of any of its elements a , we recall that A; contains an 
inverse co-set a2-B to a,-B. As in (6), we then find az-bj-a; = be , where b, and 
be belong to B. This shows that a;’ belongs to the co-sets B-a:-B which are 
contained in A;. Hence we have 

THEeorEM 4. The submultigroups of a quotient system A/B when considered 
as a set of elements in A are identical with the subgroups of A containing B. 


4. Concepts in group structures. We shall now define various concepts 
which are of importance for the theory of group structures. We consider as 
before the structure > of all subgroups of a given group G. <A unit quotient in = 
is any quotient 


€ = A/A. 
The product of quotients may be introduced in the following manner: If we have 
4 = A/B, SB = B/C, € = A/C, 
we shall write 
C=AxX 8, ax C=, Cx "=H, 


and say that % is a left-hand and % a right-hand factor of ©. It is often con- 
venient to consider any subgroup A or even the group G itself as a quotient 
with respect to the unit element 


Y= A/E, G = G/E. 
The system of all quotients may be made into a structure, the quotient struc- 
ture of >. When 
%, = Ai/Bi, W%, = Ao/B:, 
we define 
(M1, Me) = (Ar, A2)/(Bi, Bs), (2%, , M2] = [Ai, Ae]/[Bi , Be]. 


We shall apply these concepts mainly in the case where %, and %: have the same 
denominator 


(7) 4%, = A,/B, YW. = Ao/B, 
and hence 
(8) (Wi, Me) = (Ar, As)/B, (M1, We] = [Ai , Ae]/B. 


The two quotients %{, and %2 will be said to be relatively prime when their cross- 
cut is a unit quotient, i.e., when 


(A,, Az) = B. 


® Foundations I, Chapter 3. 
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We may also apply the principle of duality and define the duals of these 
concepts. Corresponding to (7) we consider quotients with the same numerator 


(9) B, = A/B,, BS. = A/B,. 
We define their left-hand union and cross-cut respectively as 
([B: , Be]. = A/(B,, Bs), (Bi, Be): = A/[Bi , Ba). 


The two quotients (5) are |.h. relatively prime if their ].h. cross-cut is a unit 
quotient, i.e., if 


[B, , Bs] = A. 
Next it is convenient to introduce the notion of transformation. Let 
(10) a = A/B; € =C/B 
be two quotients with the same denominator. The quotient 
(11) WH = (A, CG] X C* = CAC” = [A, C//C 


is then said to have been obtained from & through (right-hand) transformation 
with ©. 

The transformation has a series of simple properties which we shall now 
enumerate. The proofs follow directly from the definition (11). 


Lema 1. [a, G] = CAC" x CE. 

LemMa 2. (BX OAB x ©)* = B(CAC)B". 
Lemma 3. (A, BIC" = [(CAC*, CBC”). 
Lemma 4. (C X BA X B(C K B)* = CAC". 
Lema 5. C(M X B)C* = GAG" xk CBC", 


where 
€, = BEB”. 


This rule for the transformation of a product may be extended to an arbitrary 
number of factors. 


Lemma 6. = [2% X A, Bi X BI/[M, B) = [MM We’, B. Bi Bz"), 
where 
% = ABA", B. = BAB". 
The following theorem is of considerable importance. 
a 5. Ifa product 8B X € hasar.h. factor A, then B has the r.h. factor 


Proof. Since 8 X € has the two factors U% and GC, we can write 


Sx € = BS, X [M, G), 
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and division by € gives 
B= 3, xX CAC”. 


Dually corresponding to right-hand transformation we also have left-hand 
transformation, which may be defined as follows. Let 


(12) w= A/B, D = A/D 
be two quotients with the same numerator. The quotient 
(13) 4” = D* X (A, D}. = DAD = D/(B, D) 


is then the left-hand transform of % by D. 

In the special case where the quotients % and € in (10) are relatively prime, 
i.e., [A, C] = B, we shall say that YY’ in (11) has been obtained from % by a 
r.h. similarity transformation. In the same manner %”’ in (13) is obtained from 
YW in (12) by a L.h. similarity transformation if A and D are |.h. relatively prime. 
The |.h. and r.h. similarity transformations are inverse in the sense that the I.h. 
similarity transformation of YW’ in (11) by [A, C]/A gives A, while the r.h. similar- 
ity transformation of WX’ in (13) by B/(B, D) also gives . 

Let us finally mention 

THEOREM 6. Any right-hand union of two r.h. relatively prime quotients with 
the same denominator 


M = (A, Bl, (A,B) =€ 
is also the Lh. union 
M = (A’, VB’), (U,V), = © 
of two L.h. relatively prime quotients 
w = BAB", W = ABA" 


obtained from A and B by similarity transformation. 

The proof is immediate. The dual of Theorem 6 is obviously also true. 
Furthermore it may be extended to the union of an arbitrary number of quotients 
each relatively prime to the union of the rest. 


Chapter 2. The law of isomorphism 


1. Correspondences. A fundamental theorem in group theory is the ordinary 
law of tsomorphism: 

Let A and B be two subgroups such that A is normal in [A, B]. Then (A, B) is 
normal in B and there exists an (element) isomorphism between the two quotient 
groups 


[A, B]/A = B/(A, B). 


We shall now consider what remains of this law when we drop the condition 
of normality and consider two arbitrary subgroups A and B of G. We denote 
their union and cross-cut by 
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M = [A, B), D = (A, B), 
and we wish to compare the two quotient systems 
Y= M/A, B= B/D. 
The quotient system % is made up by certain co-sets 
(1) B= {---, dD, --- }. 


Those co-sets of % which have the same multipliers as in (1) we shall call the 
co-sets of A corresponding to B. They form a subset 


(2) WM = f--- BA, ++] 


of A. These co-sets are all different, and hence we have the well-known 
THEOREM 1. For any two groups A and B the index relation 


{{[A, B]:A} = {B:(A, B)} 


holds. 

Obviously the corresponding co-sets (2) form a generating system for the 
quotient system % in the sense that one obtains & by taking all finite products 
of them. Let us now determine when they actually constitute the whole group 
[A, B]. In this case each element of [A, B] must belong to some co-set b- A and 
hence each product a,-b; also has a representation b-a. We shall say that two 
groups A and B are permutable if 


[A, B] = A-B = B.-A, 


i.e., if to any element a; in A and }, in B there exist other elements such that 
a,°b; = be-a2. If A and B are permutable, there exists a one-to-one correspond- 
ence b-A = b-D between &% and B. We shall call this the regular co-set cor- 
respondence and write A= B. 

THEOREM 2. The necessary and sufficient condition that there exist a regular 
correspondence between the quotients 


4 = [A, B)/A, B = B/(A, B) 


is that A and B be permutable. 

A consequence is 

THEOREM 3. Let the index {B:(A, B)} be finite. The necessary and sufficient 
condition that 


(4) ([A, B):A} = {B:(A, B)} 


is that A and B be permutable. 

One may now ask when there exists a structure isomorphism between the two 
quotient systems &% and B. One can always obtain a correspondence between 
the subgroups of & and % in the following manner: Let us denote by A and B 
arbitrary subgroups in & and B 
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(5) [A, B] >A 


IV 


A, B > B = (A,B). 
The correspondence 
(6) A — (A, B), B — [A, B] 


will then be called the regular structure correspondence between A and B. We 
can then prove 

TuHeoreM 4. The necessary and sufficient condition for the regular structure 
correspondence to establish a structure isomorphism 


(7) [A, B|/A ~ B/(A, B) 
is that for every A and B defined by (5) we have 
(8) A = [A, (B, A)], B = (B, [A, B}). 


Proof.’ The relations (8) are obviously necessary and sufficient in order 
that the regular structure correspondence give a one-to-one correspondence 
between the subgroups of % and B. Furthermore, when they are satisfied we 
find for B, and B, with the images A; and A, 


(B,, B,] —- [A, B, ’ B,) = [A., Aj], 
(B, , B:) = (B, Ai, As) — [A, (B, A1, A2)] = (Ai, Aa). 


The conditions (8) are seen to hold if A has the following property: For any 
C and Din M = [A, B| we have 


(C, [A, D)) - [D, (A, C)), when C D, 
(C, [A, D]) = [A, (C, D)], when C 2 A. 


In this case we shall say that A is structure normal in M. 

Let us finally suppose that A and B are permutable so that the regular co-set 
correspondence gives a one-to-one correspondence between the co-sets of M& and 
%. We shall now make the stronger assumption that to every ain A and bin B 
there exists an exponent n,,, such that 


(10) a-b = b"*.a’, 


where a’ belongs to A. This condition is equivalent to saying that A shall 
be permutable with all subgroups of B. 

We can then prove 

THEeoreM 5. Let the group A be permutable with all subgroups of B. Then 
there exists a strong structure isomorphism 


(11) [A, B)/A = B/(A, B), 


and the regular co-set correspondence between the two quotients also gives the regular 
structure correspondence. 


IV 


(9) 


7 This result has been derived for any structure in the paper by O. Ore, On the theorem of 
Jordan-Hiélder, Transactions Amer. Math. Soc., vol. 41 (1937), pp. 266-275. 
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Proof. Any subgroup B is made up by co-sets b: D, b:D, ---, and to these 
correspond the co-sets b; A, bp A,---. These co-sets must again form a group, 
since all elements in a product 6;A-b2A are containetl in co-sets of the form 
b,b?-A = bs3-A. Hence we have B — [A,, B}, and it is seen immediately that 
the first relation (8) is satisfied. In the same manner any group A is made 
up by co-sets bA where the b form a group B = (A, B) and also the second rela- 
tion (8) is satisfied. Hence we have shown that the regular co-set correspond- 
ence gives a structure isomorphism identical with (5). The fact that (11) is a 
strong structure isomorphism follows from Theorem 2 or 3, since A is permutable 
with every B. 

It is also easily seen that in this case A is structure normal in [A, B]. 


2. Permutable groups. The preceding results indicate the necessity of a 
study of the permutable groups.* We shall now give some of their simplest 
properties. 

THEOREM 6. If A and B are permutable groups, then 

(C, [A, B)) = [A, (C, B)], C 


(D, [A, B}) = [B, (A, D)], D 


IV 


A, 


12) B. 


IV 


Proof. To prove, for instance, the first of these relations, we observe that the 
right-hand side is always contained in the left-hand side for any groups. If now 
A and B are permutable, every element of [A, B] has the form a-b. If this 
element is to be at the same time an element c of C, we must have c = a-b, 
where obviously b belongs to C, since C 2 A. 

Furthermore let us mention 

Lemma 1. If A is permutable with B and C, then A is permutable with [B, C). 

Lemma 2. A group is permutable with all its subgroups. 

Lemma 3. If B = B’ and B’ is permutable with A, then B’ is permutable 
with (A, B). 

Proof. If d is an element of (A, B), then b’-d = c-b,, where c belongs to A 
and also to B. 

Lemma 4. If A = A’ and B 2 B' and A’ and B’ are permutable, (A’, B) and 
(A, B’) are permutable. 

The proof follows by two applications of the preceding lemma. 

TuHEeorEM 7. Let A = A’ and B = B’. Furthermore let A’ be permutable 
with (A, B) and (A, B’), while B’ is permutable with (B, A) and (B, A’). Then 
there exists a one-to-one correspondence 


(13) & = [A’, (A, B)]/[A’, (A, B’)] = [B’, (A, B)]/[B’, (B, A] = B- 
between the co-sets of these two quotients. 


8 Various properties of permutable groups have been given by E. Maillet, Sur les groupes 
échangeables et les groupes décomposables, Bulletin de la Société Math. de France, vol. 28 
(1900), pp. 7-16. 








160 OYSTEIN ORE 


Proof. We write 
M = [4’, (A, B’)], N = (A, B). 


From Lemmas | and 2 it follows that M and N are permutable, and from The- 
orem 6 we obtain 


[M, N] = [A’, (A, B’)], 

(M, N) = (A, B, [A’, (A, B)]) = [C, B’), (A’, B)]. 
The application of Theorem 2 gives us the correspondence 
(14) A = (A, B)/((A, B’), (B, A’), 


and the same correspondence is found for B. 

Permutability of two groups is not a self-dual concept, since the structural 
relations (12) do not imply their duals. There exists, however, a theorem which 
may take the place of the dual of Theorem 7: 

TueorEeM 8. Let A = A’ and B 2 B' and let A and B both be permutable 
with [A’, B’). Then there exists a correspondence 


(15) W, = (A, [B, A’])/(A, [B’, A’]) = (B, [A, B’])/(B, [A’, BY) = Bs. 
Proof. In this case we write 
M, = (A, [B, A’), N, = [A’, B’, 
and M, and N;, are permutable according to Lemma 3. Theorem 6 gives 
(M,, Ni) = (A, [B’, A’), 
[M, , Ni] = [A’, B’, (A, [B, A’})] = ([B, A’), [A, B’D), 
and from Theorem 2 one obtains 
(16) YW, = ((B, A’), [A, B’)])/[A’, BI. 


TueoreM 9. Let A = A’ and B = B’, where A’ is permutable with B and B’ 
and B’ is permutable with A and A’. Then the correspondences (13) and (15) hold, 
and we have A = A, and B = B,. 

Proof. It follows from Lemmas 1, 2 and 3 that in this case the conditions for 
Theorems 7 and 8 are satisfied. The equality of the quotients is obtained from 
Theorem 6. From the correspondences (14) and (16) we deduce the following 
self-dual relation: 

THEOREM 10. Let A = A’ and B = B’, where A is permutable with B and B’, 
while B’ is permutable with A and A’. Then 


(17) (A, B)/(A, B, [A’, B’]) = [A’, BY, (A, B)/1A’, B’. 


One finds, namely, from Theorem 6, 
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(A, B’), (B, A’)] = (A, B, [A’, BY), 
({A, B’, [B, A’)) _ [A’, B’, (A, B)}. 


There are various other interesting problems connected with permutable 
groups. One of the main problems is the determination of all permutable 
groups. It may be formulated as follows. Let A and B be given groups. Find 
all groups M = [A, B], where A and B are permutable groups isomorphic to A 
and B respectively. 

Another problem is the relation of the permutable groups to their structural 
properties. This problem may be formulated as a converse of Theorem 6: 
when do the relations (12) imply that A and B are permutable groups? 

We shall next determine when the correspondences we have derived may be 
replaced by strong structure isomorphisms. According to Theorem 5 we shall 
then have to consider groups which are permutable with all subgroups of other 
groups. For such groups we have 

Lemma 5. If B and C are permutable with all subgroups of A, then |B, C] has 
the same property. 

Lemma 6. If A = A’, then A is permutable with all subgroups of A’. 

Lemma7. If A = A’ and B is permutable with all subgroups of A’, then (A, B) 
has the same property. 

Proof. Leta’ be an element in A’ and d an element in (A, B). Thend-a’ = 
a’"-b, where b belongs both to A and B. 

From these remarks we obtain 

THEOREM 11. Let A = A’ and B = B’, where A’ and B’ are permutable with 
all subgroups of (A, B). Then 


[A’, (A, B)]/[A’, (A, B’)] = [B’, (A, B)]/[B’, (B, A’). 


Proof. In this case the conditions of Theorem 7 are satisfied. Furthermore, 
M is permutable with all subgroups of N according to Lemmas 5 and 7. Hence 
we conclude from Theorem 5 the strong structure isomorphism 


i (A, B)/((A, B’), (B, A’)), 


and similarly for 8. Corresponding to Theorem 8 we have 
TuHEeorEM 12. Let A 2 A’ and B 2 B’, where |A’, B’| is permutable with 
every subgroup of Aand B. Then 


(A, [B, A’])/(A, [B’, A’]) = (B, [A, B’])/(B, [A’, B’). 


Corresponding to Theorem 10 we have 
THEOREM 13. Let A = A’ and B 2 B’, where A’ and B’ are permutable with 
every subgroup of A and B. Then 


(A, B)/(A, B, [A’, B’]) = [4’, BY, (A, B)I/[A’, B’. 


Let us observe finally that if A and B are groups such that A is permutable 
with every subgroup of B and B permutable with every subgroup of A, the rela- 
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tions (12) obviously must hold. In this case it is seen, however, that also the 
dual relations 


[C, (A, B)] = (A, [B, C)), C 
[D, (A, B)] = (B, [A, D)), D 


IIA 


A, 
B 


(18) 


IIA 


are fulfilled. Again the interesting problem arises whether the existence of the 
relations (12) and (18) is sufficient to conclude that A is permutable with every 
subgroup of B and B permutes with every subgroup of A. 


3. Quasi-normal subgroups. We shall now introduce a new concept. 

A subgroup A of G is said to be quasi-normal when it is permutable with every 
subgroup of G. 

This condition may also be stated as follows. For each g in G and a in A 
there exists an exponent n,,, such that 


(19) a-g=g"**-a’. 


The quasi-normal subgroups obviously generalize the ordinary normal sub- 
groups. In that case we have n., = 1 for all a and g. The condition (19) 
may also be stated in the more symmetric form 
a-g" =g" -a’. 

The quasi-normal subgroups have several properties in common with ordinary 
normal subgroups. Let us mention first that from Theorem 6 follows 

THEorREM 14. Jf A is a quasi-normal subgroup and B and C arbitrary sub- 
groups, then 


(B, [A, C]) = [C, (A, B)] when B = C, 
(B, [A, C]) = [A, (B, C)] when B = A. 
It is obvious that when A and B are quasi-normal in G, [A, B] has the same 


property. 

TuHEorEM 15. If A is quasi-normal in G and B is any subgroup of G, then 
(A, B) is quasi-normal in B. 

Proof. Let d be any element of (A, B) and b any element of B. Then d-b = 
b”-a, where a must belong both to A and B. From Theorem 5 follows immedi- 
ately 

THEOREM 16. When A is quasi-normal in [A, B), then (A, B) is quasi-normal 
in B, and we have the strong structure isomorphism 


(20) [A, B])/A ~ B/(A, B). 


To prove the next theorem we shall need 
Lemma 8. Let A be quasi-normal in G. When g is any element in G and ao 
some given element in A, we can always write 


g” = (gao)”-a. 
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Proof. We write g = gao-a)' and apply (19) to the various factors in the 
product g” . 

TuEeoreM 17. In the structure isomorphism (20) any quasi-normal group A in 
[A, B] containing A corresponds to a quasi-normal group B in B containing (A, B), 
and conversely. 

Proof. The first part of the theorem follows from Theorem 15, since B = 
(A, B). To prove the converse we consider the relation 


(21) b-b = b"-b,, 


which holds for all elements in B and B. The elements of [A, B] have the form 
g = b-a, while the elements of A are of the form a-b. By means of Lemma 8 
we obtain from (21) b-g = g"-d,, and when this relation is multiplied on the 
left by an element in A we obtain d-g = g”-d: for arbitrary d in A. 

Theorem 17 gives us the ordinary lemma that if A and B are maximal quasi- 
normal groups in G, then (A, B) is maximal quasi-normal in A and B. 

THEOREM 18. When A’ is quasi-normal in A and B’ quasi-normal in B, then 
[A’, (A, B’)] is quasi-normal in [A’, (A, B)] and similarly |B’, (A’, B)] is quasi- 
normal in [B’, (A, B)], and there exists the strong structure isomorphism 


(22) [A’, (A, B)}/[A’, (A, B’)] = [B’, (A, B))/[B’, (B, A’). 


Proof. The isomorphism (22) is a consequence of Theorem 11. Letl = d-a’ 
be an element of [(A, B), A’] and l’ = d’-a’ be an element in [(A, B’), A’]. 
Since (A, B’) is quasi-normal in (A, B), we find 


I'-1 = d’-a,-d-a’ = d’-d"-a, = d™-l;. 


According to Lemma 8 we can write d” = [*-a;, so that we finally obtain 
V-l= Tk. 

Theorem 18 represents the complete analogue of the lemma of Zassenhaus for 
ordinary normal subgroups. 

Let A’ be normal in A and B’ normal in B. Then [A’, (A, B’)] is normal in 
[A’, (A, B)] and [B’, (A’, B)] is normal in [B’, (A, B)], and there exists the ordinary 
isomorphism between the quotient groups 


[A’, (A, B)]/[A’, (A, B’)] = [B’, (A, B)]/(B’, (A’, BY). 


4. Quasi-normal subgroups of the symmetric and alternating groups. In 
connection with the quasi-normal groups one may construct groups which in 
many ways are analogous to those ordinarily defined in connection with normal 
subgroups. Among these we shall mention the quasi-center. Let c be an ele- 
ment of G with the property that the cyclic group it generates is permutable 
with all subgroups of G. This is equivalent to saying that for any element 
g of G and any pair of exponents n and m we have c"-g” = g” -c".. The group 
generated by these elements c we shall call the quasi-center of G. The quasi- 
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center contains the ordinary center and also the nucleus introduced by Baer.’ 
From the definition of the quasi-center follows 

THEOREM 19. The quasi-center is a characteristic subgroup. 

A simple consideration shows that the symmetric group >, has a quasi-center 
C,, equal to the unit element except in the case n = 2, where YS, = C2, and n = 8, 
where C; = A; is the alternating group. The quasi-center of the alternating 
group is the unit element except for n = 3. 

It may be of some interest to determine the quasi-normal subgroups of the 
symmetric group ~, and the alternating group A,,._ The case of the symmetric 
group may be solved directly by the following 

Lemma 9. Let H be quasi-normal in G. If h is an element of H and t an ele- 
ment of order 2 in G, then H contains the transform tht’ and the commutator 
(t, h) = tht*h”. 

Proof. Since we can suppose that ¢t does not belong to H, we have th = h’-t. 
Since every substitution is the product of transpositions this implies 

THEOREM 20. Any quasi-normal subgroup of the symmetric group is normal. 

The proof of the theorem that the alternating group contains only the trivial 
quasi-normal groups A, and E is necessarily more complicated, since it implies 
the fact A, is simple. 

We first show 

Lemma 10. If a quasi-normal subgroup H of A,, contains a cycle with three 
elements, then H = A,. 

Proof. We may suppose n 2 4. If H contains the cycle h = (1, 2, 3), ae- 
cording to Lemma 9 it contains 


aha’ = (a, 3, 2), a = (1, a)(2, 3), a > 3, 


and hence H contains all cycles of order 3 with two fixed elements. We shall 
also need 

Lemma 11. Let n > 4 and let H be a quasi-normal subgroup of A,. If H 
contains a substitution which is the product of two transpositions without common 
elements, then H = A,,. 

Proof. Leth = (1, 2)(3, 4) belong to H. The commutator 


(a, h) = (3, 5, 4), a = (1, 2)(3, 5) 


then also belongs to H according to Lemma 9. 

We shall now proceed to the actual determination of the quasi-normal sub- 
groups of A,. We suppose n > 4 and write the substitutions of H as the 
product of cycles without common letters. We consider the cases 

I. H contains a substitution h = (1, 2, --- , a) with at least one cycle with 
a > 4 letters. We put a = (1, 3)(2, 4) and find the commutator (a, h) = 
(1, 3, 5). 


® R. Baer, Der Kern, eine charakteristische Untergruppe, Compositio Mathematica, vol. 1 
(1934), pp. 254-283. 
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II. H contains a substitution h = (1, 2, 3, 4) --- with at least one cycle of 
order 4. Then we put a = (1, 4)(2, 3) and find (a, h) = (1, 3)(2, 4). 

III. H contains substitutions h with a cycle of order 3. If h contains only 
one such cycle while the other cycles are transpositions, then h’ is a cycle of 
order 3. Hence we may suppose h = (1, 2, 3)(4, 5, 6) --- and for a = (1, 4) 
(2, 5), we find (a, h) = (1, 4)(3, 6). 

IV. All substitutions in H are the product of transpositions. According t« 
Lemma 11 we can suppose that h contains at least three transpositions h = 
(1, 2)(3, 4)(5, 6) --- and for a = (1, 4)(2, 5) we find (a, h) = (1, 6, 3)(2, 4, 5) 
against assumption. 

THEOREM 21. The alternating group contains no quasi-normal subgroup for 
n # 4. 

For n = 4 we have the well-known exception. 


- 


Chapter 3. Extensions of the theorem of Jordan-Hélder 


1. Refinements of chains. One of the main applications of the preceding 
theory is to obtain extensions of the theorem of Jordan-Hélder and its general- 
ization by Schreier-Zassenhaus.” 

In the following we shall consider two-fixed groups A > B in the given group 
G. Let 


(1) A 
(2) A 


B, > B,>--- > B, = B, 
O>Ci>-:-:->C.=B 


be two chains of arbitrary subgroups between A and B. We shall denote these 
chains by {B;} and {C;} respectively. The existence of the chains (1) and (2) 
may also be considered as a factorization of the corresponding quotient Y% = 
A/B in the sense defined in Chapter 1, 


(3) A= BxXBwxX---KB=|- Gx GX:::x G, 
where 
(4) B; = By4/B; P GC; = Cj3/C; . 


Any new chains obtained from (1) and (2) by intercalating new terms will 
be called a refinement of the given chains. The two refinements consisting 
both of r-s subgroups 


(5) B;,; = (Bi, (C;, B)], Cyr = [Cx , (Br, Crx)] 


we shall call the (lower) refinement of {B;} with respect to {C;| and of {C;} with 
respect to {B;}, respectively. It may also be considered as a further fac- 
torization of (3): ; 

1°Q. Schreier, Uber den Jordan-Hélderschen Satz, Abh. Math. Sem. Hamburg, vol. 6 


(1928), pp. 300-302; H. Zassenhaus, Zum Satz von Jordan-Hélder-Schreier, Abh. Math. Sem. 
Hamburg, vol. 10 (1934), pp. 106-108. 
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(6) B= Bia X--- X Bis, GC; = Gia X +++ K Ge, 
where 
Bi; = [Bi, (Cia, Bia)|/([B:, (C;, Bia), 
Cre = (Ce, (Cra, Bradl/(Ce, (Cha, Bd). 
There also exists a dual refinement to (5): 
(8) B;.; = (B,_,, [B;, C,)), Cia = (Cx, (Bi, Cral), 


which we shall call the upper refinement of {B;} with respect to {C;} and of 
{C;} with respect to {B;}. 
Obviously we always have 


(7) 


B;.; = B,.;, Cra 2 Ce.t- 
Corresponding to (6) we have the factorizations 
(9) Bi = Bia X= X Bis, Cj = Ca X +++ X Ce, 
where 


Bi; - (Buu, [B; , Cjal)/(Bia ’ [B; , Cj), 


(10) : 
Cer = (Cra, (Ce, Bral)/(Cia, (Ce , Bil). 
2. Cross-cut permutable chains. We shall now say that two chains (1) and 
(2) are cross-cut permutable if they have the following property: 
For each i the group B; shall be permutable with all groups 


(11) (Bin, Cj) (j= 0, 1, nek , 8), 
and for each j the group C; shall be permutable with 
(12) (Cj, B;) (z = 0,1,--- ,r). 


Let us observe that cross-cut permutability refers only to certain properties 
of a group B; with respect to certain subgroups of the preceding B;. , and 
similarly for C; with respect to Cj. . 

The main theorem on cross-cut permutable chains will now be proved. 

TuHEeoreM 1. Let {B;} and {C;} be two arbitrary cross-cut permutable chains 
connecting two groups A and B. These chains can then be refined into new cross- 
cut permutable chains with quotients corresponding in such a manner that for cor- 
responding quotients there is a one-to-one correspondence of their co-sets. 

Proof. We refine the two chains by the lower refinements (5). Both the 
refined chains contain 7-s terms, and it follows from Theorem 7, Chapter 2, 
that B;,; 2 C;,;. 

The proof of the fact that the new chains are cross-cut permutable is more 
complicated. It may be done directly by calculating the groups corresponding 
to (11) for the new chains. We shall, however, apply another method which 
yields more information about cross-cut permutable chains. We first deduce 
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Lemma 1. The chains {B;,;} and {C;} are cross-cut permutable. 
Proof. We shall have to show that B;,; is permutable with every (B,, 1 , Cx). 
Since B;_, contains B;, ;; we can write 


(13) (Biya, Ce) = Dijin = (Ce, Bin, (Bi, (Cin, Bia))). 

If now k 2 j, then (C; , B;4) is contained in (C;_; , By), and we find 

(14) Dijin = (Ce, Bia) (k 2 j). 
When k < j — 1, we apply Theorem 6, Chapter 2, to (13) and obtain 

(15) Dijin = (Cin, Bu), (Bi, Cx)) (k Sj - 1). 


To show that B;,; is permutable with D;,;x , it is sufficient to show according 
to Lemma 1, Chapter 2, that B; and B;_;,; separately are permutable with D;_; x . 
It is obvious from the given conditions that B; is permutable both with (14) 
and (15). Furthermore, (B;_; , C;) is permutable with (14) according to Lemma 
2 and with (15) according to Lemmas 2 and 3, Chapter 2. 

Finally one proves in a similar manner that C; is permutable with the cross- 
cuts Dijin => (B;; ’ Cy-1). 

Lemma 2. The chain {B;,;} is unchanged when refined with respect to {C;}. 

Proof. The terms of the second refinement are B;,;, = [B;,;, (Cx, Bi,j-1)). 
When the expressions (14) and (15) are substituted here, one obtains respec- 
F tively 





Bi jx sat [B; , (Bia, Cj), (Bu, C;)]) = [By ; (Bia : C;)) = By ;, 
Bijx = (Bi, (Bin, Ci), (Cian, Bir), (Bi, Ci)) = Bisa. 
Lemma 3. The refinement of {C;} with respect to {B;} is equal to its refinement 
with respect to | B;,;}. 
Proof. We find C;,;,. = (Cx, (Cx, Bi,;)], and again the substitution of (14) 
and (15) gives 


aa 
Ca re arene: 


Ciin = Cri (k 2 )) 
\ Ci ik = Ci: (k 3S j = 1). 


From this lemma, together with Lemma 1, it follows that the chains { B;,;} and 
{Cx,.} are cross-cut permutable and the proof of Theorem 1 is completed. 

The preceding results also show that repeated refinements of cross-cut per- 
mutable chains give no new chains. 

Theorem | is obviously a generalization of the theorem of Schreier-Zassen- 
haus. Through specialization one can also obtain extensions of the theorem of 
Jordan-Hélder. We shall say that {B;} and {C;} are maximal cross-cut per- 
mutable chains when there exists no group between B;, and B; which is per- 
mutable with all cross-cuts (11) and similarly for C;.. and C;. We then have 

THeorEeM 2. If there exist two maximal cross-cut permutable chains between 
the two groups A and B, both chains have the same number of terms and the quotients 
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of the chains correspond tn such a manner that for corresponding quotients there is a 
one-to-one correspondence between their co-sets. 

When the index of B in A is finite, Theorem 2 shows that the indices of the 
two maximal cross-cut permutable chains are the same in some order. 

There exists for this theory a dual theory which one obtains by considering 
the upper refinements (8). We shall say that two chains {B;} and {C;} are 
union permutable when they have the following property: 

For all i the group B,« is permutable with 


[B; , Cj] (j = 0, I, +++, 8), 
and for all j Cj. is permutable with 
IC; , Bi (¢ = 0,1, ---,7r). 


For such union permutable chains all the theorems derived for cross-cut 
permutable chains will hold when the lower refinements {B,,;} and {C,,.} are 
replaced by the upper refinements {B;,;} and {Cj,.}. In Theorem 2 we must 
replace maximal chains by minimal chains. 


3. Permutable chains. We have already observed that Theorems 1 and 2 
correspond to the theorems of Schreier-Zassenhaus and Jordan-Hélder. More 
specifically they correspond to the case of composition series where one considers 
chains in which every term is a normal subgroup of the preceding, because 
cross-cut permutability refers only to properties of a group B; with respect to 
certain subgroups of the preceding B,;.. Let us now show how one can obtain 
analogues of the theorems on principal series where one deals with normal sub- 
groups of the full group A. 

We shall say that the two chains {B;} and {C;} are permutable if every B; 
is permutable with every C;. We can then prove 

THEeoreEM 3. Let {B;} and {C;} be two permutable chains connecting A and B. 
These chains may be refined into new permutable chains with their quotients cor- 
responding in such a manner that for corresponding quotients there is a one-to-one 
correspondence between their co-sets. 

Proof. It follows from Theorem 9, Chapter 2, that in this case the upper 
and the lower refinements (5) and (8) are identical. Furthermore, we have 
B;,; = €;,. The fact that the refined chains are again permutable may be 
derived from Lemmas 1 to 4 in Chapter 2. 

Through specialization of Theorem 3 we can again obtain an analogue to the 
Jordan-Hélder theorem. We shall say that two permutable chains are maximal 
when no further term may be intercalated such that the resulting chains are 
still permutable. 

THEOREM 4. Any two maximal permutable chains between two groups A > B 
will have the same number of terms and their quotients correspond in such a manner 
that for corresponding quotients there exists a one-to-one correspondence between 
their co-sets. 
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For finite groups this theorem, in the slightly weaker form when the indices 
of the two chains are the same in some order, is due to Maillet." 


4. Quasi-normal chains. The next natural step is to seek structure iso- 
morphism for the corresponding quotients in the two chains. We shall say that 
a chain is guasi-normal when each term is quasi-normal in the preceding. The 
main theorem is then 

THEOREM 5. Let |B;} and {C;} be two quasi-normal chains connecting A and 
B. These chains may be refined into new quasi-normal chains in such a way that 
there exists a correspondence between their quotients, with corresponding quotients 
having strongly structure isomorphic quotient systems. 

Proof. We apply the lower refinements (5) and all statements of Theorem 5 
are consequences of Theorem 18, Chapter 2. 

Theorem 5 is a complete analogue of the theorem of Schreier-Zassenhaus. 
The analogue of the theorem of Jordan-Hélder may be expressed as follows: 

THEOREM 6. If there exist two maximal quasi-normal chains between A and B, 
both contain the same number of terms and the quotient systems are strongly structure 
isomorphic in some order. 

A maximal quasi-normal chain is of course a quasi-normal chain in which 
no further quasi-normal terms may be intercalated. 

One might finally want to obtain an analogue to the theorem on principal 
chains by considering chains in which every term is quasi-normal in the full 
group A. This is, however, the only point at which our theory differs from the 
ordinary normal theory. In this case we cannot prove that the refined chains 
are again formed by quasi-normal groups in A. This is due to the fact that the 
quasi-normal groups in A°do not form a structure, since the cross-cut of two 
quasi-normal groups need not be quasi-normal in A. 


Chapter 4. Normal subgroups 


1. Similarity. Let us now turn to the properties of normal subgroups. The 
normal subgroups of a group G obviously form a structure. Since any two 
normal subgroups are permutable we also have: 

Any three normal subgroups A = B and C satisfy the Dedekind axiom 


A structure satisfying the Dedekind axiom I have called a Dedekind structure.” 
The general theory of Dedekind structures may now be applied to the normal 
subgroups. Since the Dedekind axiom (1) is found to be a self-dual condition, 


1 FE. Maillet, Sur de nouvelles analogies entre la théorie des groupes de substitutions et celles 
des groupes finis continus de transformations de Lie, premiétre note: Sur des suites remargu- 
ables de sous-groupes d’un groupe de substitutions, Journal de Math., (5), vol. 7 (1901). 
pp. 13-82. 

12 Foundations I, Chapter 1. 
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our principle of duality holds to its full extent also for normal subgroups. One 
may generalize the theory of normal subgroups in the ordinary way by consider- 
ing groups with operators. An operator 7 in G has the property of making 
correspond to each element a an element a’ such that 

(a-b)” = a’-b’. 
A subgroup A is invariant with respect to T if 

A’ SA. 

All groups which are invariant under a certain system of operators 

Z=T7,,7:,°-° 


also form a structure, and this structure is a Dedekind structure in case the 
operators include the set of all inner automorphisms of G. 

In the structure of normal subgroups we introduce quotients and these are 
as usual associated with a quotient group. Such quotients can be transformed 
according to the rules given in Chapter 1. Let 


(2) A= A/B, € = C/B 


be two quotients with the same denominator B, whereby we assume as always 

in the following that the subgroups are considered as normal in G. We suppose 
that Wand € are relatively prime, i.e., B = (A,C). According to the ordinary 
aw of isomorphism the right-hand similarity transformation 


(3) CAC = [A, C)/C 


then gives a new quotient group which is isomorphic to & in the usual sense. 
In the more general case where & and € have a greatest common factor 2, , 


Y= % XD, C=G@xDd, 
we have according to Lemma 4, Chapter 1, 
CAC’ = GAG", 
so that the transform is isomorphic to the left-hand factor %, of 2%. 
In the same manner, if 
(4) 4 = A/B, D> = A/D, A = [B, D) 
are two left-hand relatively prime quotients with the same numerator, the left- 


hand similarity transformation 


(5) DAD = D/(B, D) 


gives a quotient group isomorphic with M%. Again if % and D in (4) have a 
common I.h. factor, the transform (5) is isomorphic to a r.h. factor of Y. 

Two quotients % and % shall be said to be similar if one can be obtained from 
the other through a series of r.h. and |.h. similarity transformations. Similar 
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quotients are isomorphic. Similarity is a special form of isomorphism and it 
follows from the general theory of Dedekind structures that it is the form of iso- 
morphism which occurs in the formulation of the decomposition theorems for 
algebraic systems. 

We shall say that when 4% and € are r.h. relatively prime quotients with the 
same denominator defined by (2), then 


(6) M = (A, C] 


is the direct union of A and ©. This corresponds of course to the ordinary 
direct product. In the same manner one defines the left-hand direct union. 
It follows from Theorem 6, Chapter 1, that to any r.h. direct union (6) cor- 
responds a i.h. direct union I = [Y%’, C’],, where WA’ and C’ are similar to A and 
©. This result may be extended to an arbitrary number of quotients. 


2. Direct similarity. We shall now study a special type of similarity. Let 
% and B be quotients with the same denominator. If there exists a third quo- 


tient © with the same denominator and relatively prime to both M% and B such 
that 


(7) § = [%, © = [%, ©), (a, ©) = (8, GF =G, 
we shall say that % and % are directly similar in . The relation (7) may also 
be written 

CAT = CVBE*. 
When & and 8% are directly similar in some §, they are also directly similar in 
[M, B]. One obtains therefore by taking the cross-cut of 5 with [W, B] 
(8) [M, G] = [B, G] = (%, B), (%, G@) = (8, G) =G, 
where, according to the Dedekind axiom, 

Gi - (G, (2, ¥)). 

The isomorphism between % and % defined by (8) is such that to any a in A 

there corresponds a unique b in 8 with the property that 
(9) acy = b-ce, 


where ¢, and cz belong to G,. The direct decompositions (8) show, however, 
that the elements of € are permutable with the elements of both Y% and B; hence 
€ belongs to the center of [A,B]. The correspondence (9) may then be written 
a = b-c where c belongs to the center, i.e., we have a central isomorphism between 
2% and % in the ordinary sense in which this term is used in group theory. 

THEeoreM 1. If two quotients A and % are directly similar, they are centrally 
isomorphic in the union [A, B]. 

The following theorem is also a simple consequence of the preceding remarks: 
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THEOREM 2. Let A, B and © be quotient groups with the same denominator. 
The relations 


(10) =M = (A, B) = (a, C) = [B, G], (A, ©) = (8, © = G 
imply that © is an Abelian group belonging to the center of M. The relations 
(11) M = (A, C| = [B, C], (A,B) = (8B, © = (A, © = G 


imply that AX and B are Abelian groups. Finally, the self-dual set of relations 
(12) M = (A, B) = (A, C) = [B, G], (A,B) = (A, © = (B, © = G 
implies that M is an Abelian group. 


Let us also observe that any relation (7) implies that 2% and 8 have Abelian 
left-hand factors which are directly similar. This follows from (7) by division 
with (2, B) and application of (12). 

The dual of Theorem 1 is of considerable interest. Let %,, 8; and ©, be 
quotients with the same numerator such that 


(13) mM = [%, 9 Gi), = [B; P G)., (%,, G;), = (B:, @); = &. 


We shall then say that %, and &, are left-hand directly similar. Again it is no 
limitation to assume that in addition to (13) also the relation 


(14) Mm = [(%, Bil. 
holds. In order to analyze the content of these relations, we write 
Mm = M/D, 4%, = M/A, B, = M/B, G, = M/C, 

and find that the conditions (13) and (14) are equivalent to 

M = [A, C] = [B, C], (A, B) = (A, C) = (B,C) = D. 
These relations can, however, be written as 
(15) [M, ©] = [B, G], (M, B) = (A, ©) = (B, ©) = G, 
where we have introduced the quotients 

4 = A/D, B = B/D, © = C/D. 


According to Theorem 2 the relations (15) imply that 4% and % are Abelian 
groups. But since 


CAC’ = CBC = G, 
we also obtain that @, is Abelian. This in turn means that since 
m= © xX G, 


the group € contains the commutator group of M. To obtain a completely 
dualistic formulation of these results let us define the anti-center of a group 
as the quotient group with respect to the commutator group. We then have 
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THEOREM 3. Let %, and B, be left-hand directiy similar quotient groups. 
Then we have the relations 


M = (WM, Bl = Wh, Gh, (%, G). = (RB, G@). = G, 
where ©, is an Abelian group which is a left-hand factor of the anti-center of M. 
This theorem is the dual of Theorem 1. It shows the duality between the 
center and the anti-center of a group. The existence of such a duality was 
already observed by Speiser™ in connection with the direct product decom- 


positions of a group. Here it is explained as an instance of our general principle 
of duality. 


3. Properties of three normal subgroups. Since the Dedekind axiom is 
satisfied for normal subgroups, any three normal subgroups will generate by 
cross-cuts and unions a special Dedekind structure containing in general 28 
normal subgroups. The discussion of this structure leads us to a set of relations 
between three normal subgroups which we have analyzed in Foundations I, 
and which we shall not repeat here. We shall only make one application to 
obtain a particular theorem for groups. 

Let A, B and C be the given groups. To abbreviate we shall write 

R = [(B, C), (C, A), (A, B)I, 
R = ([B, C], (C, A], [A, B)), 
and 
T', = [(A, [B, C)), (B, C)] = ([A, (B, ©)], [B, C)), 


while T, and 7’¢ are obtained by permutation of letters. We can now prove 
for any Dedekind structure 


R = [Ta , Te] = (Ts, Te] = (Te, Tal, 
R = (T4, 7s) = (T2, Tc) = (Tc, Ta). 
This means that if we put 
Y= 7T,/R, B= T;/R, € = Tc/R, 


the conditions (12) in Theorem 2 are satisfied, and hence we have the following 
interesting 


TueoreM 4. For any three normal subgroups A, B and C the quotient group 
4 = ({A, B}, [B, C], [(C, A])/((A, B), (B, C), (C, AD] 
is Abelian. 
A study of all possible similarity relations for quotients formed by consecutive 
elements in the structure generated by A, B and C reveals that these quotient 


groups fall into seven different classes of similarity. We shall state the results 
briefly. They may all be proved by application of the law of isomorphism. 


13 A. Speiser, Theorie der Gruppen von endlicher Ordnung, 2 Auflage, Berlin, 1927, p. 136. 








174 OYSTEIN ORE 


TueoreM 5. For any three normal subgroups A, B and C there exist the similar- 
ity relations 


(16) [A, B, C)/[A, B) = [B, C]/([A, Bl}, [A, C]) 
= [C, (A, B)]/Tc = C/(C, [A, B)), 


and two others obtained by permutation. Three dual sets of similarities are ob- 
tained by interchanging cross-cuts and unions in (16). Finally, there exists the 
self-dual set of similarities 


([A, B], [A, C])/[A, (B, C)] = ((B, A], [B, C))/[B, (A, C)] 
= ([C, A], [C, B))/{(C, (A, B)| 
=> (A, [B, C])/[(A, B), (A, C)] & (B, [A, C])/[(B, A), (B, C)] 


(17) oi 
= (C, [A, B])/[(C, A), (C, B)] 


= R/T. = R/Ts => R/Tc 
=~ 7T,/R = T;/R = Tc/R, 


and all the quotient groups (17) are Abelian. 

The last remark follows from the proof of Theorem 4. 

The similarity relations of Theorem 5 actually go back to Dedekind." They 
have been restated in part by Remak” and Garrett Birkhoff.”® 

Another finite substructure of a general Dedekind structure may be defined 
by any four normal subgroups A > a and B > b. In general the corresponding 
structure contains 18 elements and the consecutive quotients fall into 7 similar 
classes. Since most of these similarity relations are trivial, we shall only men- 
tion 

THEOREM 6. For any four normal subgroups A > a and B > b there exist the 
self-dualistic relations 


[a, b]/[a, (b, A)] = [b, (A, B)]/(A, B) 
and 
([a, B), [b, A})/[a, b] = (A, B)/[(a, B), (b, A)] = fa, (A, B)]/[a, (6, A)] 
= [b, (A, B)I/[b, (@, B)). 


The last relations contain the lemma of Zassenhaus for normal subgroups. 

It may be observed finally that the preceding relations can in some cases be 
extended to quasi-normal or permutable groups by using structure isomorphism 
or correspondences instead of similarity. In Chapter 2 we have already done 
this in the case of the lemma of Zassenhaus. 
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A CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 
By Artuur B. CoBLe 


Introduction. In the study of certain groups of Cremona transformations 
in S, which I have called “regular groups” (II, §§4, 5), the determination of the 
types of transformations in the Cremona group was accomplished by the use 
of a related linear group with integral coefficients. In later papers’ it appeared 
that the class of Cremona groups with such related linear groups is probably 
quite extensive. It is the purpose of the present paper to discuss linear groups 
of this character independently of their association with Cremona groups. 
They are generated by a finite number of involutorial elements of a particular 
type, derived in §1, and characterized more completely in §2. These groups 
divide into two classes according to the values of a certain constant, e = +1, 
and they have an additional integer parameter «. The linear groups first men- 
tioned occur when «€ = 1, e = —1. These occurred in pairs of “associated 
groups” and in §3 this isomorphism described as association is extended to 
generic e. 

Each group has an invariant linear, and an invariant quadratic, form. This 
imposes the necessary conditions on the coefficients of the generic element, 
which are obtained in §4. These conditions, however, are not sufficient. The 
de Jonquiéres subgroups, defined in §5 after the manner of the de Jonquiéres 
group of planar Cremona transformations, are used in §§7, 8 to separate the 
cases of finite and of infinite order. 

The types of symmetric transformations, already determined when « = 1, 
e = —1, are found for the general case in §6. 

A particular glass, g,(a), of these groups, whose generators are unusually 
simple, is studied in §9 with particular reference to aggregates of products of 
ternary de Jonquiéres transformations, and again in §10 with reference to the 
nature of the coefficients of its elements. The writer hopes to consider the 
more general group in a later paper. 


1. A particular type of involutorial matrix. In connection with the so-called 
“symmetric Cremona transformations” there occur linear transformations with 
integer coefficients, the matrix of the coefficients having the particular form 


a ~~. Se 

6 — —e€ —<é 
(1) _——e ag | gees 

6 —e€ —<¢ —y 


Received February 23, 1937. 

1 A.B. Coble, Point sets and Cremona groups, I: Trans. Amer. Math. Soc., vol. 16 (1915), 
pp. 155-198; II: Trans. Amer. Math. Soc., vol. 17 (1916), pp. 345-385. 

2 A.B. Coble, Groups of Cremona transformations in space of planar type, I: this Journal, 
vol. 2 (1936), pp. 1-9; II: this Journal, vol. 2 (1936), pp. 205-219. 
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where a, B, y, 5, € are positive integers and where y may be zero. An example 
is the well-known transformation 


, 
Xp = 2% — 1% — Xe — 2s, 


, 

71} => 2X — fe Be, 
(2) , 

z= M-— XN = Tks 

, 

3 = YY — WM — Xe, 


associated with the quadratic Cremona transformation in the plane. 

In the instances mentioned these linear transformations are involutorial. 
We ask then under what conditions a square matrix of type (1) and order r + 1 
is involutorial. The customary conditions yield the following: 


r= ] r >i 
a — ps = 1, a’ — rp = 1, 
(3) B(—a + y) = 0, —aB + By + (r — 1)Be = 0, 
i(—a + 7) = 0, ab — yi — (r — l)de = 0, 
—8 +7 = 1, —B+7 +(r—- Ne =1, 
—68 + Qve+ (r — 2)€ = 0. 


In the case r > 1, the last two conditions yield (y — «)” = 1, whence the entire 
set can be written as 
y=et+e (e = +1), 
(re + e)’, dla — (re + ef] = 0, 
e(re + 2e), Bla — (re + e)] = O. 


(4) a 
6p 


Since 6 and @ are to be greater than zero, we have 
(5) The matrix (1), if of two rows (r = 1), ts involutorial if y = a and 6, B are 
complementary factors of a” — 1(a > 1); if of (r + 1) rows (r > 1), is involutorial 
ify =e+e(e = +1),a=re+e, andi, Bare complementary factors of «(re + 2e), 
the caser = 2, € = 1,e = —1 being excluded. We assume also that « > 3. 
Certain particular cases of matrices (1) which are involutorial are excluded 
by the restrictions imposed in (5). These would be of no interest in connection 
with the groups about to be defined. We denote by J2..., the linear trans- 
formation on variables x2), 21, --: ,2, whose matrix of coefficients is (1) as 
limited in (5). 
It may be verified without difficulty that J, ..., has the absolute invariants 


Q = 6x) — Biri + --- + 27), 
L = rb2t%y — (a — 1)(a1 + coe + 2z,). 


(6) 


F 
: 
a 








et ha a, 8 eae 
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It has the relatively invariant linear form 


(7) L’ = rixy — (a2 + 1)(, + +--+ + 2,), 
with multiplier —1. It has also r — 1 linearly independent invariant forms of 
type x; — 2; (i,j > 0;% # 3), which are absolutely unaltered if e = —1, and 


reproduced with a multiplier —1 if e = 1. From these facts we conclude in 
geometric language that 

(8) The involution Iy2..., with matrix (1), as limited in (5), is always of the 
type perspective at a point. Given the quadratic form Q, this point, or its polar 
linear form, defines the involution. If r = 1, this polar linear form is either L or 
L'; ifr > 1, itis L or L’ according as eis 1 or —1. If r = 1, the determinant of 
the matrix is —1; if r > 1, this determinant has the value (—1)’ or —1 according 
aseisior —1. 


2. The linear groups G,(a), G,(r, «, e). Let ji, --- ,j- be any combination 
of r indices from 1, 2, --- , p (p > r), and let J;, ... ;, be the involutorial linear 
transformation on variables 2, x;,, --- , 2;, With the same matrix of coefficients 
as was assigned to J,..., in 1, the remaining variables in the set 1, --- , 2, 


being unaltered. We define the group g,(r, e, e) to be that generated by the (*) 


involutions I ;, ... ;, formed for the givene 2 lande= +1. Thus g,(r, ¢, e) is 
a group of linear homogeneous transformations on p + 1 variables. 
There is no loss of generality in setting 


(1) 5=1, B= ere + 2) 
in the generators J;, ... ;,. For if 6 ¥ 1, the transformation 
(2) to = Xo, x; = dz; (i = 1, ---,p) 


carries the group into one for which 6 = 1. We have then 
(3) Ifr = 1, the linear group g,(a) is generated by p involutions of the form 


T;: ro = at) — (a — 1)aj, rt; = Xo — az;, t, =a, (k ¥ 0,5). 
It has the invariant forms 


Q=2 — (ae —1)tit+--- +2) L=m—-— (a—1)(u+--:+2,), 


(a 2 2). 
(4) Ifr > 1, the linear group g,(r, €, e) on variables xo, 4, +++ , 2, ts generated 
by (° involutions of the form 
ro = (re + e)ao — e(re + 2e)(xj, + --> + 23,), 
eye tj, = % — e(2j;, +++: + 2;,) — ex;, (= 1,---,r), 
1% = % (k #0, ji). 
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It has the invariant forms 
Q = x) — e(re + 2)(zi + «++ + 25), 
L = rx — (re+e—1) (1 +--+ + 2,). 
Naturally, if e = 1, L may be taken in the simplified form 
(5) L = % — ea +--+: + 2,) (e = 1). 


Thus, for r = 1, there exists a g,(a@) for every a 2 2 and for every p = 2. 
For r > 1, there exists a g,(r, €, e) for every « 2 1, for every p 2 r + 1, and 
for each e = +1, except when r = 2,€ = lle = —1. 

It is clear from the nature of the generators that 
(6) Ifr <p’ < p, the g,:(r) is a subgroup of g,(r), the other parameters a, or €, e 
in the two groups being the same. 

We shall later (cf. 5 (1)) find other cases in which certain groups g,- are sub- 
groups of another group, g,. 

If we consider that the generating involutions are harmonic perspectivities 
determined by skew S; and S,-:- in the S, of 2, 41, +--+ , 2), we see that 
(7) The group g,(a) in S, is generated by harmonic perspectivities whose spaces 
of fixed points are the So with equation (a — 1)& — &; = 0, and the S,_; with 
equation x — (a + 1)z; = 0. If e = —1, the generators of g,(r, «, —1) have 
also, for spaces of fired points, the So, (re — 2)& — (&;, + --- + &),) = 0, and 
the S,1, % —- €(@j;, +++: + 2;,) = 0. If however e = 1, the generators of g, 
have for spaces of fixed points the 8,1, L = 2%, = X= +++ = M%,_, = 0, and t 
the S,_,, L’ = xj, — 23, = +++ = Xj, — 1;, = 0, where the indices k,, --- , k,, 
are complementary to ji, «+: ,j, m1, 2, +--+, p. 


3. Associated groups. In connection with regular Cremona groups and 
their attached linear groups g,(r, 1, —1) ef. "II, §5 (27)], the author has called 
attention to an isomorphism referred to as “associated groups”. This iso- 
morphism persists in some measure for the more general linear groups considered 
here. We prove first that 
(1) If p — r > 1, the group g,(r, «, —1) is the linear transform of the group 
go(p — r, €, —1), the generating involution I ;, ... ;, of the first group being the trans- 
form of the generating involution I ;,,., -.- jp of the second group. 

For the proof it is necessary only to find a linear transformation 7 of non- 
zero determinant from x to which converts the fixed So, S,1 of I;, ... ;, (ef. 
2 (7)), i.e., 


(re = Div — (Ei, He HEH tO ee to + HH), 
into the fixed Sj, S,_. of I;,,, ... i,» Which are 
\(p = rjé - 2} _ Ges +++ >} E;,), Xo — e(Zj,.; s+ > £;,). 


Since T must have the same effect for all sets of complementary indices j;, --- , j, 
and jrii, *** , jp, 7 must be symmetric in 7, --- , z,, and we set 





than ibs: 
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Zo = lo + m(x, + --+ + 2,), 
(2) z: 
Zi = nto+ p(ait:---+2,) + qx ((=1,---,p). 


The determinant of T is q’ [lq + plp — pmn]. The conditions that S), So 
and S,_,, S,-, are conjugate respectively under T and 7~* respectively are 


U(re — 2) + mrin(re — 2) + pr + q:n(re — 2) + pr = (p — roe — 2:02, 
l— &p — r)n:m — ep — r)p:m — Hp — r)p — &g = 1:—€:0. 


These four conditions on 1, m, n, p, g respectively have the matrix 


0 0 re— 2 r 1 

0 -1 0 e(p — r) é 
ver 2B FF 0 0 (pe — rl — 2 

€ 0 —e&(p —7r) 0 é. 


If / and m are eliminated from the last three, and the result simplified by using 
the first condition, we get (¢ — «)¢g = 0. Since gq, a factor of the determinant 
of T, is not zero,é = ¢«. The conditions are then dependent. Setting p = 0, 
we find that 
(3) The groups g,(r, «, —1), g.(e — r, €«, —1) of (1) are conjugate under T in (2) 
with l:m:n:p:q = —(pe — 2): e(re — 2): —1:0:(re — 2), and with determinant 
2(re — 2)’ > 0. 

If p — r = 1, ‘we seek a transformation T which converts the fixed S), S, of 
I;, --- ;, a8 given above into the fixed S,, S; of I;,,, of g-1(@), which are (a — 1) 


& + £3,,,= 0, % — (a + 1)%;,,,. The matrix of the four conditions on 1, m, n, 
Pp, Vis 


0 —1 0 a+la+l 

€ 0 —-ea+ 1) 0 a+] 

0 0 re— 2 r 1 
re—-2 rf 0 0 a-—l. 


If the first and second equations are solved for 1, m, and the results substituted 
in the last, a comparison with the third equation yields [(a + 1) — «jq = 0, 
whence e = a+ 1. Again setting p = 0, we find that 

(4) Fore = 3, the group g,+:(r, €, —1) can be transformed into the group g,4:(a = 
¢ — 1) by the transformation T in (2) withl:m:n:p:q = —(re — 2) — eze(re — 2): 
—1:0:re — 2, and with determinant 2(re — 2)"*' > 0 in such wise that generators 
I;,...;, and I;,,, of the respective groups are conjugate. 

It is clear that no group g,(r, €, e) with e = 1 can be transformed into a group 
g(a) or into a g,(r, €, e) with e = —1 in such a way that the generating involu- 
tions of the one group pass into those of the other, since, according to 2 (7), 
the fixed spaces of the generators of the one group are of different character 
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from those of the generators of the other. Nor, for the same reason, can two 
g,’s with e = 1 be transformed into each other in such fashion that comple- 
mentary generators pass into each other. Thus association is characteristic 
of the groups g,(a) and g,(r, e, —1). 

(5) The group g,(r, «, —1) will have not only the series of subgroups isomorphic 
with g,-x(r, €¢, —1) (k = 1, --- ,  — r — 1) mentioned in 2 (6) but also subgroups 
isomorphic with g,-x.-.(p — r — k, «, —1) (l= 1,---,r—1). If however ¢ = 
1,2,thenk = 1,---,p—r-—2. 

For, according to 2 (6), g,(r, ¢, —1) contains subgroups g,-x(r, «, —1) ob- 
tained by fixing k of the variables. The subgroup g,-x(r, ¢, —1) is, according 
to (3), isomorphic with g,-.(e — r — k, ¢, —1), and this latter, with 1 variables 
fixed, yields subgroups g,-»—.(p — r — k, ¢, —1). 

The distinguishing feature of the association here derived is the correspondence 
between complementary generators of the associated groups which automatically 
sets up an isomorphism. There may well be other types of isomorphism between 
two of these groups, and even other cases of conjugacy under linear trans- 
formation. 

Reverting to (3), we observe that if p = 2r, the associated groups coincide, 
whence 
(6) The groups ge,(r, €, —1) are self-associated. Each possesses an inner tso- 
morphism in which complementary generators correspond. 


4. Relations connecting the integer coefficients of the elements of the groups 
gp. In order to include both the groups g,(a@) and the groups g,(r, ¢, e), it is 
convenient to return to the notation of 1 with 

y=ete, B=cre+2), rsB=a' —1. 


We also take the invariant forms Q, L as in 1 (6). 
Let the generic element of g, be written as 


™ " Le = ceske — Gk, *** —CejZj °° —CesZp, 
I; = alo — ant --- Oj jLj +++ — aint, (i = 1, --+ ,p). 

We first observe that 

(2) ao = 1+ a8, an = 6d;, aj; = Bb; (i > 0,7 > 0). 


For these relations are satisfied by the generators, and, if satisfied by (1), are 
also satisfied by the product of (1) and a generator. Hence they are true of 
every element. 

The quadratic relations on the coefficients a;; are as follows: 


(i,j,k = 1,---, 9) 


Dd? = a, =} = do. 
(3) S,a%; = 66b3; + 1, Yjai; = 68d; + 1, 
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Lidia; ; = b;- ao, > jb jai; = d;- coo, 


D iQ Qik = 5Bb jb; Gj - k), D ji jk j = 68d di, (i = k). 


The first column of four relations expresses that Q is unaltered by T. But it 
also expresses that the matrix T’ obtained by interchanging b; with d;, also aj; 
with a;;, in T satisfies the identity 77’ = 1, whence T” is the inverse of T. 
Hence 
(4) The inverse of T is obtained from T by interchanging 6; with d;, and aj; 
with a;;. 

The second column of four relations (3) then expresses that Q is unaltered 
by T. 

If we apply T and T™ to the invariant form L, the following four linear 
relations on the coefficients of T are obtained: 


(a =“ 1)>.d; = 1 (aoo = 1), (a aad 1)> 4b; = 7 (ao = 1), 


(5) 
Liai; = (a + 1)b; — 1, 2 jai; = (a + 1)d; — 1. 


The following fact is needed in the next section: 
(6) If T ing,(r, «, e) is a product of g generators, then 


aon = (—e)? + wl(re + 2e). 


For this relation is true for a single generator, g = 1, » = 1, ao = a = re + @. 
Let us assume that it is true for 7, i.e., that ao = (—e)’ mod(re + 2e). It is 
then true for the product 7’ of 7 and an additional generator, since ajo = 
am(re + €) — e(re + 2e)(di, + --- + d;,) and ago = aw(re + €) = aw(—e) = 
(—e’)(—e) = (—e)’™ mod(re + 2e). 

The corresponding theorem for g,(a) is 
(7) If T ing,(a) is a product of g generators with determinant A = (—1)’, then 
ao = A+ ula + 1). 

This indeed is true of the generators themselves, and it remains true when 
they are combined. 

A further property, used in 9, of 7’ in g,(a) is 
(8) The value u defined in (7) satisfies the relation 


uw — 2b; = 20 (A integral and 2 0). 


To prove this we consider the product TJ; with coefficients ajo = (n — 2)aw — 
(n — 1)(n — 3)d;, b; = (n — 2)b; — aj;. Since, for the product, A’ = —A, 
ao = (n — 2)4 + (n — 2)u(n — 1) — (n — 1)(n — 3)d; = —A + (n — 1) 
{A + u(n — 2) — (mn — 3)d;}, whence vp’ = A + u(n — 2) — (n — 3)d;. Also, 
according to (5), 2b; = (n — 2)2,b; — (n — 1)d; + 1. Hence yp’ — >; = 
(n — 2)(u — 2b) +A — 1+ 2d;. Since A — 1 is even, x’ — ¥{b; is even if 
u — X,b; is even. But, for a single generator, 1 = 1 and jb; = 1, whence 
a — Xb; is even, and remains even under multiplication of the generators. 
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5. De Jonquiéres subgroups of g,(r, «, ¢). We have found in 3 that when 
e = —1, the groups g,(r, «, —1) and g,(p — r, «, —1) are simply isomorphic in 
such fashion that, if 7, --- 7, and j; --- j,-, are complementary sets of indices 
from 1, ---,p, the generators J;, ...;, and J;, . of the two groups cor- 
respond. If we take in g,(p — r, ¢, —1) that subgroup mentioned in 2 (6) which 
leaves 2, --- , 2, unaltered, the isomorphic subgroup in g,(r, «, —1) is generated 
by involutions J... x:, ---:,.,, Where the indices 7, --- 7, are selected from 
k + 1,---,p. But this set of generators will generate a subgroup whether 
ebe lor —1. We call this type of subgroup a de Jonquiéres subgroup because’ 
of its first appearance in connection with de Jonquiéres planar transformations. 
(1) The de Jonquiéres subgroup of g,(r, €«, e) generated by involutions the 
The ... ki, «-- i,-4 18 Simply isomorphic with g,-x(r — k, €, e) (k <r). 

It is clear that, due to the relations z,, — x, = (—e)’(tm — tn) [m,n = 1, ---, 
k; m < nj, which persist under repeated application (g times) of the given 
generators, the elements of the subgroup will have like rows, 2;, ---,2,, except 
in the rectangle of coefficients under 2, ---,2;. In this rectangle of co- 
efficients the principal diagonal terms will have the additive part (—e)’. It 
will thus be sufficient to have the sum of these rows and we introduce the 
variable 
(2) o= ates +r. 

A typical generator then reads 
xy = (re + ejay — ere + 2e)o — (re + 2e) 
(ten +--+ + 27), 
(3) ee eee a’ = kxo — (ke +- e)o ~—_ Ke(tiess + eee _ Ze), 
tj = % —  — ety + °°: +2) — C2; 


(j =k+1,---,7r). 


This transformation on the r — k + 2 variables 2, o, x; is itself involutorial. 


°* Ig~e 


With generators of the type (3) we make the following change of variables: 
t = kay — (re + 2e)o, 25; = 7; (j=k+1,---,p). 


(4) 


Zo = Xo — eo. 
The transformation inverse to (4) is 
{(r — k)e + 2e}ao = (re + 2e)a — et, tj; = 2, 
©) {(r — k)e + 2elo = kay — tt. 


In these new variables the equation of the generator (3) is 


= (—e)t, 2) = 21 (l=r+1,---,p), 


(6) zo = {(r — k)e + e}zo — ef (r — koe + Qe} (ens + --- +2,), 
2; = 2% — (2x1 + ++ + 2,) — €2; j 
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Thus to the generator (3) of the de Jonquiéres subgroup there corresponds a 
generator I,,:, ...,, of the g,x(r — k, ¢, e), and thereby to every element of the 
one group there corresponds an element of the other. 

It is of interest to have the expression for the element of g,(r, ¢, e) which 
corresponds to a given element of g,x.(r — k, ¢, e). Let then the generic ele- 
ment of g,-.(r — k, €, e) be 


20 = Bozo — = jBoiz;, 

2; = Bito — 2 ,Bis2; (i,j =k+1,---,0), 
and let it be generated by g generators. According to 4 (2), (6) 

(8) Bo = (—e)” + u'{(r — k)e + 2c}, Boy = {(r — ke + 2ehedj. 


We have from (5) that {(r — k)e + 2e}xo = (re + 2e)z) — et’. On replacing 
zo in this from (7), and t’ by (—e)"t [ef. (6)], then making use of (4) and applying 
the relations (8), we can factor out {(r — k)e + 2e} to obtain the value of 24 in 
(9). The value of o’, similarly obtained, is 


o’ = kp'xo — [eku’ — (—e)’]o — ked jb;2; (Gj =k+1,---,p). 


In the light of our introductory remarks this yields the value of z; in (9). The 
remaining formula in 


8 
— 
I 


[(—e)’ + u'(re + 2e)]aro — e(re + 2e)u’(ti +--+ + a) 
— (re + 2e)D,bjz;, 
(9) a2; = pao — en’(x, + --- + ax) + (—e)’ a — ed; 2;, 
= Buto — Bu(ti +--+ + te) — 2 Brix; 
(G§ =1,---, kj j,l =k +1, --- yp) 


& 
| 


is almost immediate. Hence 
(10) If a generic element of g,-x(r — k, €, e) is given as in (7), (8), then (9) is a 
generic element of a de Jonquiéres subgroup of a g,(r, «, e). 

In another connection I have called this process of passing from a given group 
to a subgroup of a more comprehensive group, the “dilation” of the given group 
(ef. *§4]. 

In the above we have set the limit k < r. If however k = r — 1, the g,-x 
(r — k, ¢, e) would be a g,_x(a) with only one free variable in the generators. 
This case would however be included under the above argument by setting 


g,-x(1, €, e) = Jo—K (a = a + e), 


« 23ife = —1, e2life=1. 


(11) 


+A. B. Coble, The ten nodes of the rational sextic and the Cayley symmetroid, Amer. Jour., 
vol. 41 (1919), pp. 243-265. 
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6. Symmetric transformations and related theorems. It may happen that 
the groups g,(a), g,(r, «, e) contain other elements of the type 1 (1) than the 
generators. Such elements are called “symmetric transformations’. Thus 
the g,(3, 1, —1) generated by elements of type 1 (2), whose elements represent 
the effect of Cremona transformations in the plane, contains three such sym- 
metric transformations corresponding respectively to Cremona transformations 
of orders 5, 8, 17 with 6, 7, 8 F-points of orders 2, 3,6. Such symmetric trans- 
formations in a given group will generate a g,(r’, e’, e’) which is a subgroup of 
the given group and which therefore has the same invariants, Q, L. ‘ 

We cannot assume that 6 = 1 in these symmetric transformations, and will 
therefore denote them more specifically by I(r’, a’, 5’, B’, «’, e’), where a’ = 
r’e’ + e’, and 6’B’ = e'(r’e’ + 2e’). Consider first the Q, L of g,(a) in 2 (3). 
They will be unaltered by this J if 


(e’ +1)=8(a+1), BB’ =(ae—1)8, a@& —1=Pr'i'(a — 1). 


From the first and last we see that 6’ is a divisor of 2. If 6’ = 1,a’ =a,r’ = 1, 
and 8’ = a — 1. This yields merely a generator J, of g,(a). If 6’ = 2, a’ = 
2a + 1, and a = r’(a — 1), whence a = 7’ = 2,8’ = 6. From e’(r’e’ + 2e’) = 
B'S’ = 12, and a’ = r’e’ + e’ = 5, we get (5 + ce’) = 12, 2e’ + e’ = 5. Thus, 
for e’ = +1, we have two solutions, e’ = —1, e’ = 3,ande’ = 1, e’ = 2. These 
solutions yield two matrices of type 1 (1): 


5 -@ ~@ 5 ~6 <6 
(1) M:2 -2 -3 M’:2 -3 -2 
$ +3 <2; oS ~9 + 


If we examine the group g2 (a = 2) with generators J,, J:, we find that I,J, 
has the period three, whence g2(2) is dihedral of order six. The remaining 
element of period two is IyJoI,; = I2JiJ2 with matrix M. Naturally an element 
with matrix M’ will not occur in the g2(2)._ Hence 

(2) Theg,(2) is the only g,(a) with a symmetric transformation. The g,(2,3, —1) 
generated by this transformation with matrix M is a subgroup of g,(2). In par- 
ticular g2(2) is dihedral of order six, and M is in the conjugate set of three involutions. 

If the g,(2, 3, —1) were amplified by introducing the permutation group of 
41, °** ,2,, the larger group so obtained would contain the g,(2, 2, 1) generated 
by involutions M’, since M’ is M followed or preceded by the interchange of 
21, %2. We have not of course proved that g,(2, 3, —1) does not contain II,,. 
This however seems to be unlikely. We prove later that, when « = 1, either 
II,; or its even subgroup is contained in g,(r, 1, e). 

Suppose now that r > 1, and let g,(r, «, e) be generated as in 2 (4). We ask 
for generating involutions with r’ = r, e’, e’, and 6’, B’ = '(r’e’ + 2e’), which 
leave the Q, L in 2 (4) unaltered. In order to bring in a, a’ we observe that, 
since e = e” = landa = re+e,a' = re +’, 
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re(re + 2e) = rB = a — 1, 

r'el(r’e! + Qe’) = r'8'B’ = a” — 1. 
It is also convenient to write Q, L as 

rQ = rx} — (a* — 1)(ai + --- + 24), 

L = rx — (a — 1)(41 +--+ + 2,). 
Then the conditions for invariance ultimately reduce to the following three: 

(r’ — r)(a’a — 1) = (a’ — a)(r’ +17), 

i’ = (e’ + 1)/(a + 1), rp’ = (a’ + 1)(a — 1). 

Since r’ 2 r, we first consider r’ = r. Then from (3), a’ = a, 6’ = 1, B’ = 8B. 
Also, from a’ = a, we have r(e’ — €) = e — e’. The obvious case is ¢’ = «, 
e’ = e, which yields no new generator. But it may be that e’ = —e, or r(e’ — «) 
= 2e. Sincee = +1, andr 2 2,thenr = 2,¢ =~e+e,¢’ +e’ =ec. Thus 
the new involution is merely the generator Ij. with the last two rows (or the 
last two columns) interchanged as M, M’ above. As noted above, g,(2, «, —e) 
generated by involutions Jj. is not the same as the g,(2, «, e) generated by in- 
volutions I. We also have excluded in 1 (5) the case « = 1,e = —1. 


Suppose now that r’ > r, say r’ = r+i2(i 21). Suppose also that e = 1. 
Then the first equation (6) reads 


(3) 


2. 


a’(te — 2) = —e(2r + 1) 


Hence ze — 2 < O,ie.,i = lle = 1. Hencea=r+1,r=r4+i,a’ = 
2r+ 3,6 = 2,8’ =2r+4. Froma’ = 2r+3=r'e +e’ = (r+ 1c’ +e’, we 


have e’ = 1, e’ = 2, since e’ = —1 would yield a fractional «. This is the case 
(c) of Theorem (4). 
Suppose that r’ > r as before, and that e = —1. Then the first equation 
(6) reads 
a’(ire — 2r — 2i) = —2r(re — 1) — i(re — 2). 


Since r > 2,¢€ 21,7 2 1, then ire — 2r — 22 < 0. If t,t’ are respectively the 


larger and smaller of r, 7, then e¢’ — 4 < Oore 23. Ife = 3,’ =i = 1, 
sinceer > 2. Butthenr —2 <0. Againife = 2,t? =i=1. This yields a 
case, e = 2,r° = r+1,a’ = 2° — la = 2r — 1, 8 = 71,8 = 4r(r — 1), 


listed as (d) in Theorem (4). From a’ = r’e’ + e’, or 2r 
we find that e’ = 1, e’ = 2(r — 1). 

There remain the cases for which e = —1 and e = 1. These however are 
the groups g,(r, €, ¢) attached to the groups of “regular Cremona transforma- 
tions” [ef. ‘II, §4], and for these groups the symmetric transformations have 
already been determined [ef. “II, §5, pp. 368-9]. These are listed in Theorem 
(4) as (e), --+ , (h). 


—-1l=(r+ 1) +2, 
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(4) The following sets of involutions I(r, a, 5, 8B, €, e) have the same invariant 
Q, L: 

(al): (1, 2, 1,3, —, —) (b1): (2, 2e + e, 1, 2e(e + @), «, e) 
(a2): (2, 5, 2, 6, 3, —1) (b2): (2, 2e +e, 1, 2e(e + e), e +e, —e) 
(a3): (2, 5, 2, 6, 2, 1) 


(cl): (7,r + 1,1, r + 2,1, 1) (dl): (r, 2r — 1, 1, 4(r — 1), 2, —1) 

(c2): (r+ 1,2r+3,2,2(r+2),2,1)  (d2): (r +1, 2r°— 1,7, 4r(r — 1), 
2(r — 1), 1) 

(el): (3, 2, 1,1, 1, —1) (f1): (4, 3, 1, 2, 1, —1) 

(e2): (6, 5, 2, 2,1, —1) (f2): (6, 7, 2, 4, 1, 1) 

(e3): (7, 8, 3, 3, 1, 1) (f3): (7, 15, 4, 8, 2, 1) 


(e4): (8, 17, 6, 6, 2, 1) 


(gl): (5, 4, 1, 3, 1, —1) (hl): (2k, 2k — 1,1, 2(k — 1), 1, —1) 

(g2): (8, 49, 10, 30, 6, 1) (h2): (2(k + 1), 2k — 1, k, 2k (k — 1), 
k — 1,1). 

We have seen that the cases (a), (b) were not of special interest, and that cases 

(e), --- , (h) have been discussed. We consider then more particularly the 


cases (c), (d). In both we pass from 2, ---,2, to %1,°++,2%r43. Let then 
J, be the generating involution J; ,..., .1,441,...,.41 (kK = 1,---,r +1). With 
B = e(re + 2e), the product J,.:J, has the matrix 


1+ 8 —Be ~fe +++ =f +6) —B 

€ —(é — 1) —é +++ —é(e + e) —e€ 

(5) JniJe:  ¢ —— 38+ = 1) «-- eee —e 
1 —e —e —(e + e) 0 


ete —eet+e) —e(e +e) --- —ee + 2c) —(e +e). 
If in this « + e = 1, since e > 0,e = +1, thene = 2,e = —1, 8B = 4(r — 1), 
as in case (dl). From an inspection of the last two rows and last two columns 
it is clear that then 
(6) J raid = J J 41 ° 
Hence 
(7) In the case (dl) of (4), the generating involutions are permutable. Their 
product JiJ_ .-+ Js: is @ symmetric involution necessarily of type (d2). 
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In connection with the cases (c) we use the element 


a’ —B(ee + n) —B(ee + ) 
—B(2ee — e + n) —B(e + e) 
e+ —(&e +e + en) —e(ee + n) 
—(e+e)(e+ 7-1) —e(e + ©) 
e+n —e(ee + n) —(ee + e+ en) 
(8) JrsiJ rd 41: —(e+ e(e+- 1) —e(e + e) 


Ce 


—(e+e)(2e —e+n-—1) —e(e + 2c) 
e+e —e(e + e) —e(e + e) 
—e(e + 2e) —(e+e), 


where a’ = re + e+ Ble +e —1),7 = 6 —€4+ 1,8 = ere + 2e). This 
matrix will be of the typically involutorial form only if the elements to the right 
of «’ are all equal. This requires 


etesmet+ f —e4+1 2 Be -—-e+¢ —e+1, 


whence «€ = 1. Then also the elements below a’ are equal. Moreover the last 
two elements in the second row must be equal, which requires further that e = 1. 
Then (8) is the involution J,,... , 41: described in (c2) of (4), except that either 
the last two rows or the last two columns are interchanged. Hence 


(9) J J raid = J aad rd ras = hh, sas eae r + 1) = (r, r + |? ee 


From this there follows at once that 
(10) In the gr4i(r, 1, 1), the product JiJ; (i,j = 1, --+ ,r + 1) has the period 
three, and the transform J ;J ;J ; the period two and matrix of type (8). This linear 
group contains the subgroup of even permutations of the variables 2, +++ , 2,41, 
the cyclic elements, J;J ; and (ijk), of period three, being in the same conjugate set. 

For JJ nid, = Ih, eee vrai (r, r+ 1), Jao aid, = (r, r- 1)-hh, ses rely 
whence J,-J ad pads = (7,7 — 1)-(7,7 +1) = ( -— 1,rtil,r). 

An examination of the matrix (8) for the case « = 1, ¢ = —1 shows that then 
J aad J rua = (r, 7 +1). Hence 
(11) The group g,4:(r, 1, —1) contains as a subgroup the permutation group 
Thirsty: Of 21, °** , tras, the transpositions (x,x;) being in the same conjugate set 
as the generators. 

Since g,-(r, €, e) is a subgroup of g,(r, €, e) [r < p’ < pl], there follows that 
(12) The groups g,(r, 1, e) (o > r) contain as a subgroup either the symmetric 
group II,;, or the alternating group Il,1/2, of the variables x,, --+ , x, according as 
e= —lore=1. 
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For e = —1, this was already known in connection with the regular Cremona 
groups. 


7. Finite and infinite groups g,(a) and g,(r, «, e). There is a very simple 
geometrical criterion which separates the finite and infinite cases of the groups 
under consideration, namely: 

(1) If in the space S,(x0, 21, +--+ , 2p), the linear space L = 0 cuts the quadric 
( = 0 in a quadric without real points, then the group g, is finite; otherwise, it is 
infinite. 

To apply this criterion we write Q = 0, L = 0, using the notation of 1 (6) 
in the form aj + --- + x = 6/B, (1. + --- + 2,)/~V/p = ri/(a — 1)Vp. In 
this metric form the condition given in (1) is obviously 6/8 < r°6’/p(a — 1)’. 
This reduces by virtue of r68 = a’ — 1 and a > 1 to the form 


(2) p(a — 1) < r(@ + 1). 

We divide the cases as before into 

(2a) r=1, p< (a+1)/@—- 1); 

(2b) r>1l, e=-l, p<re/(re — 2); 
(2c) r> i, e=1, p < (re + 2)/e. 


In the case (2a) with p = 2,a = 2and p < 1 + 2/(a — 1), there is only one 
solution p = 2, a = 2. 

In the case (2b) with p > r > 1, e 2 landp < r+ 2r/(re — 2), since p — r 2 
1, then 1 < 2r/(re — 2), or r(e — 2) < 2. Hencee = 2ore=1. Ife = 2, 
p<r+1+1/(r7—lhorp=r+1. Ife = 1, the caser = 2 being excluded, 
p<r+2+4/(r — 2). Thus forr = 3, p < 9;forr = 4, p < 8; forr = 5, 
p< 9;andforr >5,p=r+1,r4+2. 

In the case (2c) with p > r > 1, « 2 1, and p < r + 2/e, there is only one 

solution e = lp =r+1. 
(3) The groups g,(a), g(r, €, €) are finite in the following cases: (a) g,(a) = 
g2(2); (b) g,(r, «, — 1) fore = 1,2 andp = r+ 1; fore = landp=r+2; 
fore = 1,r = 3, and p = 6,7, 8; fore = 1,r = 40nd p =7;fore=1,r=5 
and p = 8; (c) g,(r, «, 1) fore = 1,9 = r+ 1. Inall other cases the groups are 
infinite. 

In this section we prove that the cases mentioned are infinite, leaving a dis- 
cussion of the finite cases for the next section. We observe first that the cases 
(b) for « = 1 are known in connection with the regular Cremona transformations. 
It has been proved [ef. ‘II, §5, pp. 375, 377] that go(3, 1, —1) and gs(4, 1, —1) 
are infinite. Hence go,.(3, 1, —1) and gsi:(4, 1, —1) also are infinite [cf. 2 
(6)]. According to 3 (3), their associated groups, gs4.(6 + k, 1, —1) and gs,, 
(4 + 1,1, —1) are infinite. Again by the use of 2 (6), we see that go;i4m(6 + k, 
1, —1) and gg,...(4 + 1, 1, —1) are infinite. But these cover all the cases (b) 
for « = 1 except those asserted to be finite. 
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There remain to be considered only the cases 
(di) g(a) = gs(2); (de) goa 23); (ds) grso(r, 2, —1); 
(ds) grui(r, € > 2, —1); (ds) grae(r, 1, 1); (de) @raa(r, € > 1, 1). 


In each of these cases we have selected the smallest value of p which yields an 
infinite group, the larger values being accounted for by 2 (6). 

In the case (di), with g;(2) and its generators defined as in 2 (3), it is easy 
to verify that the form 


M;, = (3k — 1)°xo — (3k — 1)(8k — 2)a, — 3k(3k — 2)a2 — 3k(3k — 1) 


is transformed by (I eI)’ into Mis,, whence (I,IeI3)° has not a finite period, 
and g3(2) is infinite. 
In the case (dz) with a = (a — 1) 2 2, we define sequences of polynomials 
P,., C; {ef. ‘p. 111] by the recursion formula 
( ) P, = a Py — Pry, P_, = 0, P, = 1, P, = a; 
4 
Cy = Pe t+ Pa. 


As immediate consequences of these definitions we find that 
aP;; —_ Cas => P,, (a —_ 1) Px — al 4 = C;, +1, 
aP,_; ~ Cy = (a’ —_ 1)Pr1 - al, — Ca.8- 


(5) 


If then we define the forms 
(6) dD; = Py_1Xo Saad Cry = Cy-122, E, = Py1Xo - Cr321 —_ Cyr, 


it is easy to verify from (5) that J; in ge(a@) interchanges E,_,, D,, and I» inter- 
changes D,_:, E; whence J:J, transforms D, into D,,2 with respective leading 
coefficients Pas, Pry. From (4) it follows that P, — Py = Py —_ Py-2, 
whence, since P; — Py = a — 1 2 1, the polynomials P; continue to increase 
and J;J, has not a finite period. Thus go(a@ 2 3) is infinite. 

In the case (ds), g,42(r, 2, —1) is associated with, and isomorphic to, g,+2 
(2,2, -—1). Sincer 22,r+2 24. In any case g,,2(2, 2, —1) contains sub- 
groups g,(2, 2, —1) and it is sufficient to show that g,(2, 2, —1) is infinite. Con- 
sider the form 


JI, = ‘et ') to — (2k° — 1) 1 — 2k 22 — a(*t ') (xs + 2%). 


It is easy to verify that the element J 1273, of gs(2, 2, —1) transforms J; into 
Jit. Hence this element is aperiodic and g,(2, 2, —1) is infinite. 

In the case (dy), g-4:(r, € > 2, —1) is associated with g,.;(a 2 2) [ef. §3 (4)}. 
Since r = 2, g,4:(a) is either g3(a) or contains subgroups g;(a). But, according 
to cases (d,) or (de), gs(a = 2) is infinite. 


‘S$. F. Barber, Planar Cremona transformations, Amer. Jour., vol. 56 (1934), pp. 109-121. 
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In the case (ds) of g,,2(7, 1, 1) denote the generator J;, ...;, by Ji,., i,..- 
Since r 2 2,r + 2 2 4, and generators J,1,,, J:41,-42 With non-overlapping 
indices exist. Again it is not difficult to verify that the form 


Li = (*) [tro — m1 — +++ — a,-2] — 8k 2-1 — (8K —1)2, 


2 
2k 
—4 (3) (Xr41 + 2r42) 


is transformed by (J,-1,--J+41, +42) into Ly.,. Hence this element of the group 
is aperiodic and the group is infinite. 
In the case (dg) of g.41 (r, € > 1, 1) let Tj, ... 4, = Ji,,,. The linear form z, 
transformed alternately by J,.:, J, yields the sequence 
[to — err + ++ + 24)] — (€ + 1)z, 
(e of 1)[20 —_ e(2, i os Cah —_ (e + 1)2, = e(e€ a 2) 2r41, 


e(e + Ito — (zr, +--+ + 2-1] — (C+ 1) + € — 1)t, — ee + 2) 2041, 


These forms have the type 

alo — (21) + +++ + 2y-1)] — da, — cds, 

(e+ 2)a-—-b-—c=1. 

If (7) is transformed by J,,; , it acquires coefficients a’, b’, c’, where 
(8) Jryiz:a’ = (1+ 6a —b, b’ = e(e + 2)a — (1 + ©)b, ce’ =. 
The interchange in this of b, c and of b’, c’ gives the effect of J, upon the form 
(7). But this J, and J,4; in (8) are the generators J;, I, of a ge(a), where 
a = 1 + € = 3 and in which also 8 = 1,5 = a — 1. Thus, according to the 
result in case (de), the effect of J,J,,; upon forms (7) is aperiodic, whence this 
element J,J,4: is aperiodic and g,,:(r, « > 1, 1) is infinite. 


(7) 


8. Finite groups g,(a) and g,(r, «, e). In this section we verify the finiteness 
asserted in 7 (1), (3) and determine the nature of the finite groups. We take 
up the cases as they are listed in 7 (8). 

With respect to the case (a), or go(2), we have seen [ef. 6 (1) et seq.] that this 
is a dihedral group of order six. 

With respect to the case (b) we observe that, apart from g,4,:(r, 2, —1), all 
the instances have the form g,(r, 1, —1) and therefore have been found in con- 
nection with regular Cremona transformations. We list these in order. 

The g-si(r, 1, —1) (r > 2) is a gos2): isomorphic with the symmetric group on 
r + 2 things. This is the linear group associated with Moore’s cross-ratio 
group of Cremona transformations [ef. *]. 


®’ E. H. Moore, The cross-ratio group of n! Cremona transformations of order n — 3 in flat 
space of n — 3 dimensions, Amer. Jour., vol. 22 (1900), pp. 279-291. 
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The g,42(7, 1, —1) (rF > 2) is a gorse); ar+1. It has an invariant Abelian go,.. 
whose factor group is isomorphic with the symmetric group. If r = 2p, it 
has an invariant ge whose factor group of order (2p + 2) !-2°” is isomorphic 
with the collineation group which transforms into itself the collineation group 
of order 2°” induced on a hyperelliptic Kummer p-way by the addition of the 
2°” half periods. If r = 2p — 1, the group is a subgroup of the one just de- 
scribed [ef. *]. 

The self-associated [ef. ‘II, §5] ge(3, 1, —1) is isomorphic with the gsisio of the 
27 lines on a cubic surface [ef. ‘II, §3). 

The two associated groups g:(3, 1, —1), g:(4, 1, —1) each have an invariant 
gz Whose factor group of order 8! 36 is isomorphic with the group of the double 
tangents of a planar quartic [ef. ‘IT, §3]. 

The two associated groups gs(3, 1, —1), gs(5, 1, —1), each have an invariant 
gz whose factor group of order 10! 96 is isomorphic with the group of the tri- 
tangent planes of a space sextic of genus four on a quadric cone [ef. ‘II, §3]. 

In the last instance under case (b), according to 6 (7), the generating involu- 
tions, Ji, --- , J-4,, are permutable and the finite g,:(r, 2, —1) is abelian, of 
order 2°**, and type (1, 1, --- , 1). 

There remains only the case (ec) of g,4:(r, 1, 1). The theorems 6 (8), (9), (10) 
indicate that this group is isomorphic with a symmetric g;-;2);.. We make this 
proof more precise by using the set of r + 2 linear forms: 


m, = —(r + 1)to + (7 + 2)(ti + +++ + 241), 
(1) m; = % — (r + 2)z; @=1,---,r+1), 
my + my + ++ + ys = 0. 


We find that J,,; changes the sign of each of m,, --- , m,, and interchanges 
m,, Mrz, With a change of sign in each. Moreover, the transforms of r + 1 
of these forms, and the invariance of L, defines the transformation. Hence 
gr4i(r, 1, 1) is isomorphic with the permutation g;-;2): of these r + 2 forms. 
The form m,, is the form —L’ of 1 (7) formed for the symmetric element J used 
in 6 (9). The g,4:(r, 1, 1) does not contain the element J. If this were added 
as an additional generator, the doubled group with invariant g. = 17 would 
contain the odd as well as the even permutations of 7, «++ , 4:1. 

The geometric criterion given in 7 (1) for finiteness recalls the theorem of 
Minkowski ("p. 185) that a positive quadratic form in n variables cannot admit 
more than (2"*' — 2)" integral linear transformations. Hence it can admit 
only a finite group of such transformations. Yet this theorem can not be applied 
here directly. For if, say, 2 is eliminated by using the invariant linear relation 
L = 0, our group is converted into one which has rational coefficients [ef. IT, §6]. 


* A. B. Coble, A generalization of the Weddle surface, etc., Amer. Jour., vol. 52 (1930), 
pp. 439-500. 
7H. Minkowski, Geometrie der Zahlen, Leipzig, 1896. 
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9. The generic element 7’ of the group g,(a). We have in 4 taken this ele- 
ment of the group g,(a@) generated by 7; (i = 1, --- , p), 

’ ro = at) — Bri, r= 6%) — ax, a=2; (j=1,--:, 937 ¥%). 
a) a 22, 68 = a — 1, 

in the form 

(2) Ee x, = Amolg — AmiL1 — *** — AmpLp (m - 0, l, seis »p)- 
We have also obtained certain numerical properties [4 (2)] of these coefficients, 
and certain quadratic and linear relations [4 (3), (5)] satisfied by them. We 
seek to prove that, with one type of exception noted below, the coefficients 
mn (m,n = 0,1, --- , p) are positive integers or zero. If 7 could be attached 
to a type of Cremona transformation, this could be inferred from the fact that 
geometric multiplicities are positive or zero. We proceed then to define a type 
of Cremona transformation determined by T. 

The ternary de Jonquiéres transformation of order n has a set of 2(n — 1) 
simple F-points and a single F-point of order n — 1. Its effect upon curves 
with order t), and multiplicities yi, --- , yern—1) at the simple F-points, multi- 
plicity t; at the (n — 1)-fold F-point, and multiplicities &,--- ,t, at p — 1 
further ordinary points is expressed by the equations: 


to = nto —_ (y + = + Yocn—1)) =; (n —- It, 


Yi =bh-—y—h (@¢ = 1,2,---,n— 1), 
(3) os: , 

t, = (n — 1)to — (yi + ++ + Yxn_») — (mn — 2)hi, 

ti = ¢; (j = 2,---,p). 


The ¢’, y; are multiplicities at the inverse F-points of the transformation. If 
now we form products of such transformations, always taking the simple F- 
points of the last factor at the 2(n — 1) inverse F-points of the preceding product 
which arose earlier from the simple points of the factors of this product, and 
not at any time allowing more than p additional F-points to appear from the 
(n — 1)-fold F-points of the factors (i.e., forming products as though the 2(n — 1) 
F-points could be fixed for all factors), we then secure an aggregate of types of 
products whose effect upon curves is given by the aggregate of elements of the 
group generated by J,, --- , J,, each J being formed like J; in (3). 
Since y:, +++ , Yon—1) Occur symmetrically in all of the J’s, let 


(4) = Yt Yo t +++ + Yan. 
Then (3) takes the form 


th = nb —o —(n— 1)t;, 


ao’ = An — 1)tp — o — 2A(n — 1)ky, 
ti = (n — 1) — o — (n — 2h, 
tj = t; G = 2, ---, 9). 


(5) 
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- This also is an involutorial transformation on the variables ¢, t. We now 
make the change of variable 
; (6) 2r = —o + BW, r+a2;=t; (j = 0,1, ---, 9). 


Then all of the generators J take the typical form 
ro = (n — 2)am — (n — I)r, 
(7) x, = (n — 3)xo — (n — 2)x, 


, , 


, 
e x=2, 2 = @, win Lp = Zp. 


; But (7) is the generator J, of g,(@) amplified by x’ = z, where 
t (8) a=n—2, =n—3, B=n-—1, and =a —1. 
° Hence 

(9) The generic type of ternary Cremona transformation which is a product, 
) formed as indicated above, of de J onquiéres transformations of order n is isomorphic 
. with the generic element (2) of the group g,(n — 2). 


Suppose that the generic element T of g,(n — 2) with constants (8) is given 
as in (2) with the necessary conditions on its coefficients as determined in 4 (2), 
(7), (8), namely 


aso — 1 = a(n — 3)(n — 1), aio = (n — 3)d;, 
’ (10) oO; = (n <_ 1)b; (i,j = a ser ty »P)» 


a —-A= u(n = 1), oo > jb; = 2. 





. We seek to determine the corresponding type of ternary Cremona transforma- 
f fs tion. From (6) and (2) we find that 


, 7 . 
to = x’ + x = © + ants — Dja0jX; = F + allo — x) — Vja0,(t; — 2) 


“Oia 


SRI La 








= 2[1 — aw + Zja0;] + aoole — = ja0;t; 
. = 3(—o + 2to)[1 — a + = ja0;] + cool — = jarjt;. 
) 4 On substituting for a and a; from (10), we have for ¢, and similarly for the 
j E other variables, 
to = {1 + (m — 1)E:dj}to — [ZN t --* + ym) — (n— 1)ZHe4,, 
yi = {ZDi jlo — HE; — wi +--+ + Yama) + Ay — Ziti 
(11) {i= 1,---,2(m — 1), 
th = {(n — I)di}to — diy + -*+ + Yow) — Dyrans-ty 
[kK = 1,---, 9}. 


—o + 2, 


. ae . . . 
This value of y; is determined indirectly from o’. For, —o’ + 2t 
whence 


) Te Se 


o = {2(n - 1)Z bj} _~ (25 jb; _ l)o — 2(n “— 1)> b,j. 
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Now @’ is obtained from y; + --- + yocn—1, and the values of the y’’s are the 
same except in the matrix of coefficients of y:, --- , Yana. This square matrix 
has like elements —m except in the principal diagonal where they are —(m + 1). 
On comparing the o’ arising from such rows with that just given, and noting 
that 


2230; — 1 = 2A(n — 1)(3(2; — w)} — A, 


we obtain the y; given in (11). Since =,b; — pis even [cf. 4 (8)], the coefficients 
in (11) are integral. 

(12) The transformation (11) defined by the coefficients a;; of T in g,(n — 2) 
represents a geometrically existent ternary Cremona transformation, the product 
of a set of de Jonquiéres transformations. It is formed from generators J in (3) 
precisely as T is formed from generators I. 


Since (11) is attached to a geometrically existent transformation, its co- . 


efficients have a variety of properties which can be transferred at once to the 
coefficients of T which are found in (11). We recapitulate some of these: 

(13) (a) The coefficients a;:; of the generic transformation T of g,(a = n — 2) 
with 6 = n — 3, 8 = n — 1 are positive integers or zero with the exception that if 
ai; = — 1(t > 0,7 > 0), then every coefficient in the same row and column as aj; is 
zero. (b) With every set of x equal numbers b; there is associated a set of x equal 
numbers d;. The x columns and x rows of the matrix of coefficients thus isolated 
have in common a square matrix M, whose elements are all —m except that in each 
row and column of M, there is one element —(m + 1). Otherwise the x columns 
are identical and the x rows are identical. (c) For a particular element a;; (i > 0, 
j > 0), the inequality (n — 1)(b; + di) S 1 + (nm — 1)2b; + aj; ts valid. 

This last inequality arises from the known inequality (ef. ‘II, p. 368), ¢; + pi 
< m+ aj;, which occurs in connection with a ternary transformation of order 
m which possesses an F-point of order p; and of multiplicity a;; on a P-curve 
of order ¢;. 

We have just used the ternary transformations to obtain properties of the 
group g,(a), but clearly the reaction is mutual, as is expressed in (12). Thus 
one may read off from the rows and columns of coefficients in (11), the char- 
acteristics of homaloidal nets and of P-curves in terms of the rows and columns 
of the comparatively simple elements of the group g,(a). We may then expect 
to find applications of the groups g,(a) and g,(r, e, e), not merely in connection 
with new space transformations and groups where they first were observed, 
but also in connection with aggregates of transformations already studied. 

The underlying restriction a = 2, applied in the above account, yields n — 2 
=2orn = 4. It is interesting to notice that in the excluded cases n = 2, 3 
the products of de Jonquiéres transformations as used above may be regarded 
as the elements of a ternary Cremona group. For if the 2(n — 1) simple F- 
points of the direct and inverse transformations could be superposed at positions 
Pi, *** » Pansy, fixed for all the generators, these generators would yield an 
actual group. Thus, if, for n = 2, p;, pe are fixed at the circular points and 








PE 
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le . . . . . 
Ps, Ps is variable, the group of inversions is determined. If, for n = 3, the four 
simple points of the cubic transformation, both direct and inverse, are fixed at 


Pi, *** , Pa, then the double F-points must coincide at O,, since the five direct, 
and five inverse, F-points are projective. The generator Jo, is then the projec- 
tion of each conic of the pencil on p;, --- , ps into itself from O,. A quadratic 


involution with F-triangle at pe, p3, ps and fixed point p, converts the conics 
into lines on p,;, and the lines on O; into conics on po, ps, ps, Pi. Thus our 
generators are transformed into cubic involutions with a fixed double F-point, 
three fixed simple F-points and one variable F-point. They generate an infinite 
de Jonquiéres group with a pencil of invariant lines through p,;. 

For n = 4 the two sets of direct and inverse simple points of the de Jonquiéres 
transformation cannot lie in superposed position. 


10. Inequalities satisfied by the integral coefficients of the elements of g,(a). 
In this section we take g,(a) with 6 = 1,8 = a’ — 1. We shall be concerned 
primarily with those linear forms which are respectively the conjugates of 2, 
and the conjugates of x; , under g,(a), i.e., with the rows of the various elements 
in g,(a). It is convenient to use also the contragredient form g,(a); of g,(a)z 
which is defined by the following contragredient invariant: 


(1) foro — (f1mi + +++ + E505) = foto — (firs + --- + Et). 
Then the contragredient generators of g,(a),z and g,(a); are 

2) = at — (a* — 1), f = ab — &, 
Q) h@:;) ~ i 

t= Xo — at; i = (@ — 1) — ah. 


The invariants of the two groups are respectively: 
L(x) = 2% — (a — 1)(% +--+ + 2), 
L(é) = (a + 1)& — (i + +++ + &), 
Q(z) = x) — (a* — 1)(zi + --- + 3), . 
Q(s) = (a* — 1) — (Gi + +++ + &). 


A set of values c of the z’s will be called a characteristic C(x) (x = c), and 
this set will often be indicated by a form cof — (citi + +--+ + ¢,&); a set of 
values y of the £’s will be called a characteristic C(£) ( = y), and again this set 
will often be indicated by a form yoto — (yiti + +++ 4+ Yp%p).- 

The characteristics C(x) indicated by & and its conjugates will be called 
characteristics H(x). A list of the early conjugates follows: 


(3) 


H _ fo, 
{1 + (a — 1l)}h — &, 
= {1 + (a — 1)(a + 1)}h — af — §&;, 


= 
= us 
| ll 
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Hiss = (1 + (a — 1)a'}h — (a — ati — af;, 
Heys = (1+ (a — Ie + a4 ID} — ats — at; — &, 
(4) Hiss = (1 + (@ — Ie’ — a — a + I)}h — (a — 2a’ + 1YE, 
— (a — a)é;. 
H jxjs = {1 + (a — 1)(a® + 1) fh — (a — a + 18; — (e" — aE; — eek. 
Husjs = {1 + (a — 1)(@’ + I} i — (@ — a’ )E — ot — &, 
Hinjs = {1 + (a — 1)(a° + a’) }& — (a° — adi; — at; — aete, 
Hujs = {1 + (@ — 1)(a® + a + @ + 1)} bo — aE; — aE; — at — bh, 


2 


It is clear that 
(5) The set of conjugates (4) of & under g,(a): have for polars as to Q(x) the set 
of conjugates of x) under g,(a)z. The form H;;... is the transform of H by the 
product I,(&)-Ié)---. The coefficients ho, hi, -+-,h, of a particular form 
hofo — (hid: + --- + h,pt,) satisfy the relations Q(h) = 1, L(h) = 1. 

The characteristics C(é) indicated by x; and its conjugates will be called 
characteristics P(). A list of the early conjugates follows: 


Pi = @j, 

Pi; =% — axj, 

Py = at) — (a — 1) — azj, 

Pu; = (e° — ax — ala’ — a — 1)a; — (a — 1)%, 
Pint = a % — ala — 1)x, — (a& — 1) — aj, 


(6) Pixie = (a° — 20° + 1)x9 — (a* — 20° — a” + 2a)x. — aa” — a — 1)z,, 
Pyajt = a (a — 1)a0 — (a” — @)(a” — 1); — a(a® — @ — 1)2; — (a — 1)ax, 
Pin = (@ — a + 1) — (a — a — & + a), — a(a® — 1)2, 

— (a — 1)x, 
Pia; = (a — ax — (a — 20°); — ala’ — 1)2; — (a* — 1)ax, 


Prim = 0X9 — a (a — 1)tm — a(a® — 1), — (a — 1)rx — an;, 


Again it is clear that 

(7) The set of conjugates (6) of x; under g,(a), are such that P j, ... is the transform 
of x; by the product I;(x)-I,(x) ---. The coefficients m, m,-+- , 7, of a par- 
ticular one of these forms, moto — (1121 + --+ + 2,2,), satisfy the relations L(x) = 


1, Q(x) = —1. 








r= 


wererrTr 
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We prove first the theorem: 
(8) Any characteristic (ho; hi --- h,) for which L(h) = 1, Q(h) = 1, and for 
which h; = 0 (j = 1, --- , p), when arranged so that hi = ho 2 hz; 2--- = 
satisfies the inequalities 


0<h < (a+ I)h 


except in the one instance (1; 0 --- 0) given by H in (4). 

For, from L(h) = 1,a 2 2 and h; 2 0, it follows that ho > 0. If ho = 1, 
it follows from h; + --- + h, = 0 that h; = 0 (Gj = 1,---, ,). This yields 
the excludefl characteristic (1;0 --- 0). Ifho>1,h>0. Ifthen hp = --- = 
h, = 0, L(h) = Q(h) = 1 become (a — 1)h; = ho — 1, (a — 1)hi = WB — 1. 
By division, (a + 1)h; = ho + 1, whence (h) = (a; 10 --- 0), which satisfies 
(8). Ifh: > 0, we use the inequality hi(he + --- +h,) [AE AS + --- +h2. 
This, combined with L(h) = Q(h) = 1, yields [(ho — 1) — (a — 1I)(a + lh, = 
(hg — 1) — (a° — 1)hi. Hence (ho — 1)(@ + 1h = hi — 1. Since ho > 1, 
(a + Iki = ho + 1, or ho < (a + 1)hy. 

We prove further that 
(9) Every characteristic (h; h, --- h,) which satisfies L(h) = Q(h) = 1, and 
which satisfies the system (infinite except when p = 2, a = 2) of inequalities Pj, - - - 
(x = h) = O obtained from the forms (6), can be reduced by transformations in 
g,(a)z to the characteristic (1;0 --- 0). 

For, since (h) satisfies x; = 0, the hi, --- , A, are positive or zero, whence ho 
is positive. According to (8) there is an h; such that ho < (a + 1)h;. Then 
I(x) applied to (h) yields a characteristic (h’) for which hg < ho and which also 
satisfies the inequalities h; = 0, since these inequalities for h’ arise from in- 
equalities in the set (4) satisfied by (h). The element ho can therefore be con- 
tinually reduced until it reaches 1, in which case the remaining h’s are all zero, 
since they still satisfy x; = 0. 

It is probable that all of these inequalities are independent. For example, 
when a = 2, there is a case (h) = (11; 61°) which satisfies x; = 0 but which does 
not satisfy P; > 0. There is also a case (h) = (20; 971°) which satisfies r; = 0 
and P; = 0, but which does not satisfy Pj, 2 0. 

A list of characteristics for which L = Q = 1 and z; 2 0, for this case a = 2 
and complete up to n = 17 is as follows: 


(1; 0), (2; 1), (4; 2, 1), (5; 2°), (8; 4, 2, 1), (10; 4”, 1), (10; 5, 2°), (11; 6, 1')*, 
(13; 6, 4, 2), (13; 7, 2, 1°)*, (14; 6, 5, 2), (16; 8421), (17; 9321°)*, 
those marked * not being reducible to (1; 0 --- 0). 


(10) If a characteristic (h) = (ho; hi «++ h,) is reducible in g,(a)z, when a = 3, 
to (1; 0 --- 0), it satisfies the following system of inequalities: 


(I) ah Sho <(a+1)h; (I) bk >he > --+ > he > hen = 0 


= hy = --- =h,; 


(III) ho > (a + 1)ho; (IV) ho — ahz > hy. 
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For if (h) is reducible to (1; 0 --- 0) in g,(a),, then conversely (1; 0 --- 0) 
can be transformed into (h) by a sequence of involutions J,. Such a set of 
transforms, hofo — (hiéi + +--+ + Apé,), of (1; 0 --- 0) by sequences of four or 
fewer involutions is found in (4). The statements of (10) are satisfied by all 
of these transforms, and furthermore the largest h; (¢ = 1, --- , p) is that which 
arises from the involution last used. Suppose then that these statements re- 
main true for transforms under all sequences of n or fewer involutions. If (h,) 
is such a transform, we apply an additional involution to obtain a transform 

(h'n41) — (aho = (a° = 1)h;; ho — ah; , Ai, ss, hj, hin, ao h,), 
where j = 2 else (hj,4:) is an (h,_,). Then the inequalities (II) are satisfied by 
(h',41), since (II) and (IV) are satisfied by (h,). The second part of the in- 
equality (I) is then a consequence of (8). The first part of the inequality (I), 
hy = ahj, reduces toh; = 0. The inequality (IV) is an immediate consequence 
of L(h’) = 1 which can be written in the form 

ho — ah, = hy + 1 + [(a — 3)hy + (@ — I)(hs + +> +h) + (i — Aad]. 
Finally, the inequality (III) is a consequence of (IV) and h; < hy. 

(11) The generic conjugate of xo in g,(a) (a = 3) has the form x) = hor) — (a — 1) 
(hia, +--+ + h,x,), where the numbers ho, ---,h,, when properly ordered, 
are those described in (10). 

This brings out the rather unlooked for result that when a 2 3, none of the 
hi, --: ,h, are equal unless they are zero. Thus the matrices M, of 9 (13)(b) 
do not exist. However this is not true when a = 2, as the list preceding (10) 
shows. 

The conditions, finite in number, given in (10) do not replace the infinite 
number mentioned in (9). They include the first of these, such as x; 2 0 and 
P; = 0, but they do not lead to P, = 0 [ef. (6)]. 

We now consider the characteristics (a ; 7 --- ,) mentioned in (7), examples 
of which are given in (6). The first of these, (0; —1, 0, --- , 0), is somewhat 
exceptional in that m = 0 and a 7; is negative. This is the only solution of the 
equations L(r) = 1, Q(x) = —1 for which m = 0. It is also clear that the 
second example, (1;a 0°"), is the only solution of these equations for which 
™,°**,7, are zero. We now prove that 
(12) Except for the two particular cases just mentioned, every characteristic 
which satisfies the equations L(r) = 1, Q(r) = —1, and for which r; = 0 (¢ = 
1, --- , p), when arranged so that m, 2 m2 2 --- = m,, satisfies the inequalities: 


(a — 1)m < m < am; tm, < (a — 1l)mifa 23. 


For from the inequality m(m2 + ---+2,) 2 mateo + r,, it follows immedi- 
ately from L(r) = +1 and Q(x) = —1 that mom(a +1) = (a@ — 1)m5 + 71 +1, 
or mom (a+ 1) > (a? — 1)", 0rm > (a—1)m. Let then m = (a—1)m+hk 
(k > 0). Again from Q(r) = —1 we find that 2(@ — 1)m(m —k) =k - 1 
+s + +++ + rr. Since k > 0 and we > 0, then k < am, and m < am. 
If now also m2 = (a — 1)m + k,, then 
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—(a — 1)\(a — 3)m — 2(a@ — 1)mok — 2a — 1)mki = 
RP-lLtkita+--. +a. 


The right member is positive or zero. On the left the first term is zero or nega- 
tive if a = 3, the second is negative, whence k, < 0, or m. < (a — 1)m. 
(13) With the exception mentioned in (12) every characteristic x, for which 
(moto — m2, +°-+ + @,2,) is a conjugate of an x; under g,(a), satisfies, when 
a = 3, not only the equalities of (12) but also the additional inequalities: 


(a* — 1) > m+ am; ™m > > +s > te > We = O = wee = +++ = Tp. 


It is clear that this is true of the members in the list (6) and that further the 
largest integer 7, of the characteristic arises from the involution last applied. 
Let us then assume that (13) is true for all cases (7,) which arise from zx; by 
applying n or fewer involutions. If then we apply an additional involution 
(which is not J, , since the transform would then be a (,—,) for which (13) is 
true), the transformed characteristic is 


(x’) = (am) — 4; ; (a@ — 1) — an;, MWiy *** » Wj-1y Wjzr,*** » Tp) (j Pa 2). 
If (a° — 1)m — ax; > m, ie., if the first inequality is satisfied by (7), then the 
remaining inequalities are satisfied by x’. But this first inequality is a conse- 
quence of L(x) = 1,a 2 3, and 7; 20. For (a = 1)L(r) = (@ — 1) can be 
written as 
(a® — 1) — ame — m = (a + 1) + [(a — 3)m + (m, — m) + (a2 — 1) 

(m3 + --- + ,)]. 
On the right a — 1 > O, and every other term is zero or positive, whence the 
first inequality is satisfied by x and also by (z’). 

Some of the inequalities of (10) and (13) fail in the case of a = 2 because of 
the presence of periodic elements other than the generators. 

If a + 2, we have the theorem 
(14) The groups g,(a), a = 3 contain no periodic elements other than the involu- 
torial generators. 

For if T = I,,Jk, «++ Jk, is an element of g,(a) naturally so written that no 
two successive subscripts k are alike, then it may happen that k, = ki. Then 
T is the transform of T’ = J;, --- Ii,., by I; and T’ has the same period as 7. 
The determination of the period may again be simplified if ke = k,.,. Pro- 
ceeding in this way, we must eventually find that T' is the transform of a genera- 
tor of period two or that T is the transform of an element T” = I;,,, --- Zi, , 
where 1 + s # n — 8. Then (7”)’ is a product of j(n — 2s) generators such 
that no two successive factors are alike. But it is clear from (10) (III) that, 
as these successive factors are adjoined, the values of Ao in the characteristic 
xo = hoto — (a° — 1)(hiz; +--+ +h,2,) steadily increase so that (7”’)’ can 
never be the identity. 
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COMMUTATOR ALGEBRA OF A FINITE GROUP OF COLLINEATIONS 


By HERMANN WEYL 


1. Introduction. In Chapter V, §§2—4 of my book The Theory of Groups and 
Quantum Mechanics (English edition, London, 1931) I gave an elementary ac- 
count of the decomposition of tensor space into subspaces invariant under the 
algebra of “symmetric” transformations. The treatment was based upon the 
reciprocity between the ring of a finite group y whose elements s induce certain 
linear operators s in a given vector space ® and the algebra % of those trans- 
formations A in ® that commute with all operators s. It was immaterial that 
® was the manifold of all tensors of a certain rank f in an underlying vector space 
and y the symmetric group of all f! permutations operating on the f indices or 
arguments of the tensors. In many respects the group ring stands in a simpler 
relationship to this commutator algebra % than to the enveloping algebra B 
of the operators s, and therefore it seems desirable to discuss both sides in a 
direct way rather than to rely upon the general theory of a matric algebra and 
its commutator algebra (cf. in this regard the observations in the concluding 
section). 

A number field k may be given in which all numbers which occur are supposed 
to lie. We consider vectors 


f= (fi, +++ fn) 
in an n-space §% whose components f; are numbers in k. When the result of a 
linear transformation U = ||ujx|| on f is denoted by Uf, the components of f 


are to be written in a column. A collineation f — f in the projective (n — 1)- 
space based upon the number field k is a linear transformation U of the co- 
érdinates f; combined with an automorphism s: a — a’ of k: 


n 


fi = > Uns or f = Uf". 


k=1 
Representations of a finite group by operators of this generalized type involving 
automorphisms of the reference field were studied with remarkable success in a 
recent paper by Messrs. I. Nakayama and K. Shoda (Jap. Jour. Math., vol. 12 
(1936), pp. 109-122). The same step will here be carried out with respect to the 
commutator algebra. 


2. The group ring. The situation we are concerned with may be described 
thus. Given a finite group y of order h; if k is of prime characteristic, that prime 
shall not be a divisor of h. To each element s of y corresponds an automorphism 
a — a’ of k such that 

(a’)' _ oa. 
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A collinear representation of degree n of y associates with each s an operator 
f —f in the n-space ® of the form 


(2.1) f= Liualsdfi (é, & = 1, --- mn) 


in such a way that composition of collineations reflects the composition of group 
elements. This is expressed by the following equation for the matrix U(s) = 
|| wee (s) |! : 


(2.2) U(st) = U(s)U°® (s, tin y). 
In particular, 
(2.3) U(s)U(s"') = E- or Us") = U“(s). 
(2.1) shall be indicated briefly by 
(2.4) f=sf. 
We introduce the “quantities” of the group ring p by the formal sum 
a= > a(s)-s 


extending over all elements s of y; the components a(s) are arbitrary numbers 
in k. Such quantities are at first abstract elements forming a linear manifold 
(or “vector space’’) r of h dimensions. At the same time they serve as operators’ 
in R: 


(2.5) af = Lals)-sf. 

This “realization” suggests how to perform multiplication: one has 
ai(a: f) = af 

if a = a,a be defined by 

(2.6) a(s) = 2, ar(t)ax(t’). 


The multiplication is associative: 
(a,a2)as = @(@28s). 
Indeed, the left side equals 
Pe a(t’) a;‘ (t”), 
whereas the right side equals 


do ar(?)(a(t)as'(”))*. 


tte’t’? 


1 These operators are not collineations in the same sense as the operators f — s/f. 
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Not even the multiplication with multiples 81 of the unit element 1 of y (8 in k) 
is commutative here: while 8a is to be interpreted as the quantity with the 
components Ba(s), a8 has the components a(s)@’. 

In this section we concentrate on the abstract group ring p and shove its repre- 
sentation by operators (2.5) into the background after it has served its purpose 
of suggesting the law of multiplication (2.6). 

A part p of r closed with respect to addition and the operation 


x— x’ = ax (x — x’ = xa) 
for any a in pis called a left (right) invariant subspace of tr. is a linear subspace 
in the sense that it contains 
x + y, ax (x + y, xa) 
along with x, y whatever the number a ink. Associating 
(2.7) (a): x’ = ax 
with the quantity a gives rise to the regular representation (p) of p whose space 
is r itself: 
a(bx) = (ab)x. 

THEOREM 2A. A left invariant subspace p possesses an idempotent generator 
e (to the right), i.e., xe lies in p for every x, and xe = x for everyxinyp. The same 
is true for a right invariant subspace, but one must then write ex instead of xe. 

The theorem implies that e = 1e is in p and hence ee = e. 

Its proof is almost the same as given in my book, l.c., on pp. 291-292 for the 


customary. group ring. For a left invariant subspace p it runs as follows. 
We construct a projection x — x of r onto , i.e., a substitution 


(2.8) as) = > d(s, dx(t) 


t 


with the two desired properties that it changes every x into an x in p and is the 
identity within py. From the expression 


y(s) = Dea(r)a'(r*s) for y = ax 


one concludes that the substitution leading from 


y(s) = 2'(r's) to 9(s) = F(r's) 
is a projection as well: 


j(s) = DD d'(r‘s, r*t)y(t). 


t 


Hence the same holds for the “average”: 


e(s, t) = : > d(r's, rt). 





ate > 
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As it satisfies 
e(r's, rt) = e(s, 0), 
we may write 
e(s, t) = e(t”’s) with e(s) = e(s, 1), 
and then the projection 


as) = >> e(s, é)x(t) 


can be abbreviated to ¥ = xe. 
For a right invariant subspace a projection will be of the form 


#s) = Do d(s, tx" * (t) 


rather than (2.8), because this kind of substitution changes xa into xa along 
with x — x, and the averaging process is to be defined by 


e(s, t) = ; > d(sr, tr) = e(st™). 


Incidentally right- can be reduced to left-invariance by a simple process ex- 
changing the order of factors. With a quantity a we associate 4 as defined by 


(2.9) a(s) = 4'(s’). 

The relationship is involutorial because (2.9) entails 
a(s) = a'(s”’). 

The reader is called upon to verify the 

Lemma 2B. Ifa = ajao, then& = &4,. 

From now on only left invariant subspaces will be considered, and the specifica- 
tion “left”’ will be dropped. We derive from the existence of the generating 
idempotent these consequences (loc. cit.): 

THEOREM 2C. An invariant subspace » containing the invariant subspace 
pi < pcan be split according to p = p; + pz into p, and a complementary invariant 
subspace pe . 

Proof. e, being an idempotent generator of p,, we have 

x = xe, + (x — xe;)) = X1 + & 


for every element x of p. The first summand x; is in p,, while the second satisfies 


xe, = 0. Hence pe = {xe} is linearly independent of p, . 
A given idempotent generator e of p splits like every x of p into two parts 
(2.10) e=e, + e@ 


lying in pi, p2 respectively. As (2.10) implies the following decomposition of 
any X in p: 


xX = xe = xe, + Xe. =X, +k, 
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we have in particular 


e,e; = €, e,e = 0, 


e.e, = 0, @2@o = Cc. 

TuHEeorREM 2D. A similarity projection x — x’ of an invariant subspace p upon 
p’ is generated by aft multiplication with a quantity b:x’ = xb. 

A correspondence x — x’ is a similarity projection with respect to the algebra 
(p) of the operators (a), (2.7), if carrying x + y into x’ + y’ (linearity) and ax 
into ax’. The proposition follows at once if b is taken as the image e’ of the 
idempotent generator e of p; one then has eb = b. 

Invariant subspaces which can be put into a one-to-one similarity correspond- 
ence are called similar or equivalent. 


3. Formal lemmas. We now return to the representations (2.4), (2.5) of 
y and p by operators in R. Notice that 
s(f+f) = sf+ sf’, s(af) = a’-sf. 
For any value i = 1, --- ,» of the index 7 we denote by f,; the quantity in p 
with the components 


fils) = of; = x Uin(s)fi 


and by f the column (f, , --- ,f,) whose s-component is the vector f(s) = sf. 
The arguments used (l.c., §4) rest on the validity of the following formal lemmas. 
Lema 3A. 


(3.1) a-f; = D> anf; 
k 
with 
asx || = Da(r)U"(r). 
Proof. The s-component of a-f is the vector 
g(s) = 2 alr)(r sf)’. 
Apply 
f=rt'f) = UME, (rf) = U(r) 


to sf rather than f and thus verify the statement of the lemma. 
Lemma 3B. f;-a = g; where the vector g is defined by 


g = da (r')-rf = af. 


In other words, if g = af, then g = f-a. 
Proof. f;:a = x is indeed given by 


x(s) = > srf;-a"(r’) = Sqi. 


r 
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Lemma 3C. An equation of the kind 


(3.2) as) = Di girsh = eV(sf*, 
where g = (g1, -** , ¢@n) is a row rather than a column of numbers (contravariant 


vector) entails 


ae) @ ¥ fran 


i=1 


when this is interpreted as meaning 
(3.3) a(s) = ¢'U“(s)f. 
The linear transformation 

lai || = || X sfi-serl, 
that is 
(3.4) A= > U(s)f' ¢' U-(s) 


commutes with the operators s. 
Proof. (3.3) follows at once from (3.2) by taking (2.3) into account. g being 
an arbitrary vector in ®, we compute from the explicit expression (3.4): 


tAt’g = UW®A(t"g)' = U®A'U"(Hg, 
U®A'U"() = > U(ts)f" o" U"(ts) = A. 


4. Reciprocity between group ring and commutator algebra. The object 


of our investigation is the ring % of all linear transformations A = || aix || : 
(4.1) fi = LD Gufs 
k=l 


in R which commute with the operators s induced in ® by the elements s of y. 
For each A in &, (4.1) thus implies 


(4.2) sf; = Xe ain She or f; = Di aiefs. 


For each A in & the multiple aA lies in & provided the number a is self-con- 
jugate: a* = a@ for all group elements s. Hence & is an algebra in the subfield 
of self-conjugate numbers rather than in k itself; nevertheless we venture to 
speak of % as the commutator algebra. 

The complete reciprocity between r under the influence of (p) and ® under 
the influence of 2% can now be established in the same manner as l.c. §4. Terms 
like “invariant”, “irreducible”, “similar” or “equivalent”, when applied to r 
or ® refer to the algebra () of operators x — ax or to the algebra % respectively. 
The term “linear subspace”’ is in r to be interpreted as demanding closure with 
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respect to addition and the ordinary multiplication x — ax (not the modified 
aft multiplication x — xa) by numbers ain k. p being such a linear subspace of 
r we introduce the corresponding subspace 2 = #p of MR as the set to which a 
vector f belongs if f; is in p fori = 1,---,n. (4.2) shows at once that § is in- 
variant. Vice versa, if $ is a given linear subspace of Rt, we define p = 49 as 
the linear closure of all the quantities f; (i = 1, --- , n) arising from vectors f in $. 
If 


oP an SE". +++ (a = 1,2, ---,m) 
is a linear basis of the m-dimensional vector space $, then 4 consists of all 
quantities x of the form 
r= Dol?” = DV @e’t™) —(¢§* arbitrary). 


In particular we set 
bR = tH. 


According to Lemma 3A, 4$ is an invariant subspace of t. Moreover, by 
definition, 


aD <p, B< #5S. 


Inclusion can here be replaced by equality: * and §& are inverse operations pro- 
vided we limit ourselves within R to the invariant subspaces $-and within rt to the 
invariant subspaces » of to. Besides, those operations are conservative as to 
reduction, decomposition and equivalence. We exhibit these facts in two 
theorems: 

TuHeoreM 4A. If p(p’, pi, peo) are any invariant subspaces of t%) and $ = Xp, 
then 


p’ <p, p= pit Pe, Di ~ De 
imply 
yp’ < §, P= B+ f, Zi ~ PB 
respectively, while conversely, 
p= 5. 
TuroreM 4B. Jf $ (B’, B., Bo) are any invariant subspaces of Rand p = 4B, — 
then 
B= #p 
and 
¥’ < , $= Ft+ %, Bi ~ Pe 
imply 
p’ <p, p= pi + pe, Pi ~ Pe 


respectively. 
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Before proceeding to the almost literal repetition of the proofs, we make this 
remark. If é is the idempotent generator of an invariant p, then the correspond- 
ing $ = #p consists of all vectors of the form ef. Indeed, g = ef lies in $ 
because g; = f;€ by Lemma 3B, and for each f in $ one has ef = f. 

1. We prove the first part of Theorem 4A by observing that the decomposition 
~) = pi + Pe when applied to an idempotent generator é of p: 6 = €6, + @ leads 
to this decomposition 8 = B, + PB. of B = Kp: 


F=eF =e F +eF = F, + Fo (F in §). 
Lemma 2B allows us to shear all the roofs off the relations 


* 


é:éo = &é; = 0 (4.4: = a, &2é = és), 
thus warranting the independence of the parts 2, , Be : 
ef; = 0, eF, = 0. 


2. The similarity correspondence between p; and pe, 


X. = Xb, x, = xb’, 
gives rise to the mutually inverse transformations 
fr = bhi, fi=d'fr 


between the vectors f; , fo of $1 = *p, and PB. = pe. By (4.2) these formulas 
establish a similarity correspondence: 


fi = Af; entails bf, = A(bfi) {A in %}. 


To secure the last part, p < 43, we construct 4 by means of the idempotent 
generator @ of p as follows: if g“ (@ = 1, ---, m) ranges over a basis of the 
complete vector space 9, all f‘” = eg‘ lie in 8 = #p, and hence 


y= Dei fh” = > (oe #) 


in 8%. On introducing 
_—— Do (yg g), 


we have y = xé. So xé lies in 49 if x lies in m = 49. But each x in p satisfies 
both conditions: x in t) and xé = x. 

The converse Theorem 4B exhibits the really important facts. Its assertion 
that p = &B implies $2 = #p for any subspace ¥$ invariant under Y% is the back- 
bone of the whole theory. Let é be the idempotent generator of p = 5%. 
Like all elements of p it is of the form 

&s) = Lgi®-sfi”’, 


where 


s” = if. vee, ce) 
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ranges over a basis of $. Hence by Lemma 3C, 
e(s) = > sei” fi, 
and any vector g = ef of #p is given by 
(4.3) n= L(LaPp) = Lai”, 
where 
air = Li sfi-sei”’. 


(a) 


Each term g‘ of the sum (4.3), g = 2g‘, arises from f‘” by a linear trans- 
formation A‘ = || a{f? || which according to the same lemma commutes with 
all s. Hence §, being invariant with respect to the transformations A of the 
commutator algebra %, contains g“ as well as f‘”. This proves our statement: 
gin Sor #p < §. 

The decomposition 2 = ¥, + ff: implies by definition that each quantity 
xin p = 4 can be written as a sum x, + Xe, X, in p, = 491, Xe in po = 4Pe. 
It remains to prove that p, and jp: are linearly independent or that the inter- 
section p* = p,; ° pe is empty provided 2* = [,° L. be empty. But according 
to the part of Theorem 4B already proved, 


#p* < Kp, = £1, *p* < P ; 


hence *p* < $* and by the last part of Theorem 4A: p* < §f*. 

The transition from 2, ~ $2 to p: ~ pe for p: = 4P1, po = &Pe, is to be based 
on the following statement, the proof of which is contained in Lemma 3B: 

Lemma 4C. 1 is right as well as left invariant. 

Therefore ro has an idempotent generator i to the left: i in m, ix = x for 
every X in ft. 

Let f be a basis for B, and let the given similar mapping of 2, on B®. send 


f‘® into g. When we put 


bo ¢” f;°, 


a,i 


(a) (a) 
y= Dd ¢S" ‘gi; 


x 


(¢;*’ arbitrary numbers) 


the correspondence x — y between an x and a y with the same coefficients ¢;*’ 
will establish a similarity mapping of p, = 4B on pe = 4P2, because by Lemma 
3A, we obtain 


ax = > wo A, ay = a vi gi” 


with 


( 
we = ym gi” “Qik. 
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This definition of x — y, however, goes through only if x = 0 implies y = 0. 
We first prove that 


xf = 0 implies yf = 0 


(for any vector f). By Lemma 3C the vector F = xf is a sum of terms F'”’, 
the a-th of which arises from f‘” by the transformation 


A® = |\ast || = | 2 sfi-sei” |. 


Since A‘ is in M, the given similarity mapping of B, on P. sends F‘® into the 
corresponding part G‘” of G = yf and hence F intoG. Therefore F = 0 implies 
G = 0, and more especially, when the numbers ¢§*? satisfy the equation x = 0, 
we must have yf = 0 for every vector f, or by Lemma 3B, f;-y = 0. Hence 
the given quantity y satisfies zy = 0 for every z in rt, in particular for z = i. 
But as y itself lies in tp , the ensuing equation iy = 0 yields the desired result: 
y = 0. 

The complete reciprocity established by Theorems 4A and B involves the fact 
that the process * not only changes p = 0 into *p = 0 and a part p’ < p into 
apart *p’ < *p, but alsoap + Ointoa #p ¥ 0 and a proper part into a proper 
part provided the p’s are invariant subspaces of t.. The decomposition of to into 
irreducibly invariant subspaces » leads to a decomposition of R into subspaces 
irreducibly invariant under the algebra XU, and both decompositions run absolutely 
parallel even as to the pertaining equivalences. 


5. Representations of the group ring. The roof operation. Not so simple 
is the relationship of the abstract group ring of our quantities a to its represen- 
tation by the operators 

jaf 


in R which form a homomorphic operator algebra B, although % also breaks up 
into irreducible parts similar to the irreducible parts of p. For a given vector f 
and a given invariant subspace p of r we denote by p(f) the set of vectors xf 
arising from f by allthexinp. Let f‘” (a = 1, --- , n) bea basis for the n-space 
®R and 


(5.1) r=ptpet::: 


a decomposition of r into subspaces irreducibly invariant under (p). Our 
statement is proved in the well-known manner by going through the list 


p(f), on . n(f), 
p(f"’), — p(f”’), 


and dropping a term each time it is contained in the sum of the preceding ones. 
The construction depends on the choice of the codrdinate system f‘” as well as 
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the decomposition (5.1), and does not result in such a thoroughgoing parallelism 
as we encountered for the commutator group. Coincidence between the num- 
bers of equivalent parts is not to be expected. Whereas for the commutator 
algebra it was essential to restrict onesélf to the two-sided invariant subspace 
t, t is here to be taken modulo that two-sided invariant subspace whose ele- 
ments a satisfy the equation af = 0 identically in the vector f. 

Considering all these circumstances, it seems to me inadequate to avail one- 
self of the reciprocity’ between a matric algebra $8 and its commutator algebra 
% for getting a hold on A through %, although the most essential point, the full 
reducibility of %, can be reached by this method applicable to any abstract 
semi-simple algebra p. One could visualize the situation as follows. One 
decomposes the “one” 1 of p into independent primitive idempotents: 


1=@+&+-:::; 


e,e, = e; or 0 according asi = kori #k. (An idempotent e is primitive if not 
allowing of a decomposition e; + e2 into two independent idempotents except 
the trivial ones e + Oand0 +e.) It can be shown then that 


x= xe, +xe+---andf=eft+ef+-:: 
result in a decomposition of r and % respectively, 
t=) 48? *-*, R= Pt Bh+---, 


into irreducibly invariant subspaces. Again, f — af is a given representation 
of the elements a of p by operators in 8, and invariance in r refers to the opera- 
tions x — ax in ® to the algebra % of linear transformations commuting with 
the operations f — af. However, the correspondence thus established between 
the parts p; of r and 8; of R depends on the choice of the idempotent generators 
e; of p;. To make the correspondence independent, one must match the sub- 
space $ of the vectors ef against the subspace p of the quantities xé rather than 
xe. So one may say that the more elementary and complete reciprocity we 
expatiated on in the preceding sections is due to the existence of the operation * 
in a group ring while missing in an arbitrary abstract algebra. The rdle of * is 
further clarified by the following.’ 

TueoreM 5A. If the invariant subspaces »,, po generated by the idempotents 
€1, €2 are equivalent to each other, so are the invariant subspaces , , pe generated by 
é:, é. p and ) are the substrata of contragredient representations. 

Let the one-to-one similarity mapping 2; — 22 of p; on pe carry e; into b and 
in the inverse direction ¢, into a. We then have 


(5.2) X2 = 2b, XY, = 2a. 


2 See, for instance, Weyl, Ann. Math., vol. 37 (1936), p. 718, Th. (1.4-B). Even the above 
statement should be made with reservation, since 8 is not a matric algebra, and A not a 
matric algebra in the strict sense. 

§’ Compare (l.c.) pp. 352-354. 
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b satisfies e;b = b, and like every other element of pe , bee = b. Hence 
(5.3) €, bez = b, eae; = a. 


Moreover, if we put 2; = a in the first and z. = b in the second of the equations 
(5.2), we obtain 


(5.4) €2 = ab, €, = ba. 


Conversely the relations (5.4) guarantee that (5.2) are reciprocal mappings 
Yi = pe. We need only to “roof” these equations (5.3), (5.4) in order to con- 
clude that é, , & are linked by 6, 4 in the same fashion as ¢; , ez by a, b. 

To prove the second part we have to introduce the notion of trace: the trace 
of a quantity a, tr(a), is its unit component a(1). The trace tr(zy) of the product 
of two variable quantities x, y is bilinear in the sense that it satisfies the dis- 
tributive law with respect to decomposition z = x, + 22 of the first as well as 
the second factor, and that it takes on the numerical factor a@ if one replaces 
x by az or y by ya (distinguish between fore and aft multiplication). Moreover, 


(5.5) tr(zy) = Dd 2(s)y*(s") 


is a non-degenerate bilinear form: each of the equations tr(az) = 0 or tr(za) = 0 
when holding identically in z leads toa = 0. (5.5) is not symmetric in x and y 
as is the case for the ordinary group ring. However, the obvious equation 
tr(é@) = tr(a), together with Lemma 2B, establish the following modified law 
of symmetry: 

tr(yx) = tr(#). 
One readily verifies 
(5.6) tr(srs*) = tr*(z). 


e being a given idempotent, let p and q be the left and the right invariant sub- 
spaces consisting of the quantities ze and ex respectively. We assert that tr(zry) 
is non-degenerate if x and y vary in p and q respectively. Indeed, if z is any 
element whatever and a in p, then 


az = ae-z = a-ez = ay, 

where y = ezising. Hence the assumption tr(ay) = 0 for y in q implies tr(az) = 0 
for all z, whence a = 0. Similarly for the second factor. We now refer p 
and q each to a coérdinate system a; and b; such that 


(5.7) r= a +--+ + &a,, y=bm+--- +hm 


describe p and q if the numbers £ and 7 vary freely ink. From the non-degener- 
acy of tr(ry) for (5.7) follows readily the coincidence h = g of the dimensions of 
p and q and the possibility of adapting the coérdinate system b, in q to the arbi- 
trarily chosen coérdinate system a; in p such that 


tr(zy) = im +--+ + 50% 
for x in p and y in q. 
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We now consider the simultaneous substitutions 
(5.8) x’ = sz, y’ = ys" 
in p and q respectively. The y-substitution may also be put into the form 
(5.9) y = sy, 
where 
= mb. +--- + mb, 


now varies in the left invariant subspace p generated by é@. We write (5.8), (5.9) 
in terms of the codrdinates as 


f= Liuals\e, mi = Li vals) at. 
The relation (5.6) yielding 
tr(z’y’) = tr'(zy) or Li kin = Le kini 
proves the two matrices 


|| wix(s) ||, || vse(s) || 


to be contragredient. Hence the regular representation of p induces in p and p 
contragredient representations of y (assuming coérdinate systems in p and p 
properly adapted to each other). Observe that passage to the contragredient 
matrices 0(s) in a collinear representation s — U(s) produces a representation 
again, as is readily seen from equation (2.2). 
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TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 
By N. WIENER AND W. T. MartTIN 


1. Introduction.’ Let a, = 0 (nm = 1, 2, --- ), and 


(1) © aya" ~ sH(2) (x @). 
We may rewrite (1) in the form 

(2) Lae? 4 (=), 
where 

(3) H(z) = e"°**, 


We shall derive the following Tauberian theorem. 
TueoreM 1. Leta, = 0 (n = 1, 2, --- ), and let (2) hold, where 


(4) F is four times continuously differentiable for a < x for some a; 


(5) F’'(x) 2 const. > 0 fora S z; 

(6) F’'"(z) = o((F’"(z)}) (rx »); 
(7) F(z) = o({F"(z)}’) (x ~). 
Then for any positive i, 

(8) lim (2x) aor = 8, 


zo AN rs¥(n)setr 
where ¥(x) is defined by 


(9) I “[@'(a)Pdz = (2), 


and G(x) is the inverse function to F(x) fora S x. 

As a converse to this theorem we shall prove 

THEorEM 2. Let a, = 0 (n = 1, 2, --- ), and let (8) hold for every positive 
value of 4, where F fulfills the conditions (4), (5), (6) and (7) and y and G are 
defined as in Theorem 1. Then (2) holds. 


Received January 19, 1937. 

1 The authors’ attention was directed to problems of this type by Professor Vijayaragha- 
van. It has come to our attention that a similar group of problems has been recently 
attacked by Mr. Kales of Brown University by quite different methods. While neither 
direction of work is reducible to the other, Mr. Kales’ priority in entering the field is clear. 
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In the next theorem we place some additional restrictions on F and we obtain 
certain inequalities related to y. 
THeoreM 3. (i) Let F fulfill the conditions (4) and (5) and the further condition 


(10) for some positive «, F’(x) = O({F’(z)]'**) (x — ow). 
Then for any positive e, 

(11) n** <w(n) 

and 

(12) n** < ¥(n), 


for sufficiently large values of n. (ii) Let F fulfill the conditions (4) and (5) and 
the additional condition 


(13) F’(x) is ultimately greater than any power of zx. 
Then for any positive «, 
(14) v(n) < ni** 


for sufficiently large values of n. 

We shall obtain the following theorems on entire functions of smooth growth 
as immediate consequences of these Tauberian theorems. 

THEOREM 4. Let 


(15) f(x) = > b,2" 
0 
be an integral function, and let 
2r 

(16) a | f(re™) [Pd ~ se" 198” ean 

2x Jo 
where F fulfills the conditions (4), (5), (6) and (7). Then for any positive i, 

} 
(17) lim (27) > | Dn 1. aitaiiaeee = s, 
20 nN zs¥(n) srt 


where y and G are defined as in Theorem 1. 

The converse to Theorem 4 is 

TuHeoreM 5. Let (15) be an entire function and let (17) hold for every positive 
value of X, where F fulfills the conditions (4), (5), (6) and (7), and G and y are 
defined as in Theorem 1. Then (16) holds. 

If we place additional conditions on F, we have the following gap theorem. 

TuHeoreM 6. Let (15) be an entire function, and let (16) hold, where s ¥ 0 and 
F satisfies the conditions (4), (5), (6), (7), (10) and (13). Then (17) holds where 
vy and G are defined as in Theorem 1. Furthermore, for any positive € we shall 
always have (11) and 


(18) ni* < y(n) < ni* 
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for sufficiently large values of n. Thus the function (15) can have only a finite 
number of gaps of magnitude (v, v + v'**), that is, for any positive «, the equations 
GG, = Gant = °° = On4[nit+g = 0 
can hold for at most a finite number of values of n. 
As a consequence of Theorems 1 and 2 we shall derive the following non-linear 
Tauberian theorem. 
TurorEM 7. Leta, = 0(n = 1, 2, --- ), and let 


(19) Cn = Te. An—m (n = 1, 2, se ). 
1 


If F fulfills the conditions (4), (5), (6) and (7) and if ¥ and G are defined as in 
Theorem 1, then the following two statements are equivalent: (i) the relation (8) 
holds for every positive value of \; (ii) the relation 


(20) ton (2x)! > cp 6° 7 in) -28 (a an)) - s 


ou rsV20(in) Sztd 
holds for every positive value of X. 
For the proof of Theorem 1 we shall put 


(21) — = g(u + ¥(n)), 
so that uw occurs as the difference of two variables 
(22) u=¢ (t) — ¥(n). 


Here ¢ and y are functions to be determined with regard to the following con- 
siderations. We shall develop the exponent nf — F(é) in a Taylor’s series with 
remainder and we shall determine ¢ and y in such a manner that the coefficient 
of u shall vanish and that the coefficient of }u’ shall be negative unity. Then we 
shall use the conditions on F to show that the remainder in the Taylor’s series 
approaches zero as § — ~; indeed, that it approaches zero in such a manner 
that (2) shall imply that 


(23) ; a» eX (n)—F (a(n) 5—tu? 8 ( _ oo), 
1 


We shall then show that the General Tauberian Theorem of Wiener’ is applicable 
to (23). Here the kernel K(u) is e“*’ whose Fourier transform e™ is non- 
vanishing, and, as we have noted, u occurs as the difference of two variables. 
The relation (8) will follow immediately. 

Theorem 2, the converse of Theorem 1, will follow by an application of the 
converse portion of the General Tauberian Theorem just referred to. The other 
theorems will follow at once. 


2. Let us put 
(21) E = g(u + V(n)). 
?N. Wiener, Tauberian theorems, Annals of Mathematics, (2), vol. 33 (1932), pp. 1-100. 
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We have, formally, on holding n fast and developing in powers of u, 
(24) nt — P(E) = ne(u + ¥(n)) — F(e(u + ¥(n))) 

= no(¥(n)) — F(e(y(n))) 

+ uln — F’(e¥(n)))le’m)) 


+ 3 {[n — F’(e((n)))]e"W(n)) — F’"@W™))le'Wm))F} 


+ @D: [np(v) — F(e(v))]omou+vin » 


where 0 S 6 = @(u,n) £1. 

We desire that the coefficient of u shall be zero, and the coefficient of }u’ 
negative unity, whether or not n is an integer. Necessary conditions for this 
are, formally, that 


(25) F’(e(V(w))) = w, 

(26) ¢'(¥(w)) = ¥'(w), 

for, as regards the latter, the coefficient of }u’ is 

(27) {Du[we’(¥(w)) — F’(ew)))e’H(w))] — ew) }/¥'(w). 
Denoting by G(w) the inverse function to F’(w), we see that (25) and (26) 
become 

(28) e(¥(w)) = G(w), 

(29) [v'(w)}’ = G'(w). 


For our functions ¢, ¥ we are thus led to take 
(30) Yow) = [1G OHd, elu) = Gu), 


where a is so large that F is four times continuously differentiable for w = a 
and 0 < const. < F’(w). With y¥, ¢ so defined, (24), with w in place of n, 
is valid for w and u + y(w) sufficiently large and (25), (26), (28) and (29) are 
valid for w 2 a. 

The last term of (24) may be written 


fwe’”"(@u + ¥(w)) — F’"(e(du + ¥(w)))[e’(u + ¥(w))) 
— 3F’"'(e(du + Y(w)))¢’(du + (w))e"" (Ou + ¥(w)) 
— F’(y(du + ¥w)))e"’(6u + ¥(w))}. 


uw 
(31) 6 
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Differentiating (25) and using (26), we have 


(32) F" (ew) lew) F = 1. 

For a general argument z, (32) becomes 

(33) F’(v(2))le’@)F = 1, 

and yields on differentiation 

(34) F'"'(e@))le’@)° + 2F'"(e@))e’@e"(2) = 0. 


Thus if du + ¥(w) = z, (31) becomes 


= (we'"(2) — F"(o(e))e'e)e"(e) — F(e(e))e""(2)} 


(35) 





7 (Pov) — F(o(z))e’"(z) — ce. 


¢ (2) 
Now, by the law of the mean, there will be some v in the interval between ¥(w) 
and z such that 


(36) Pew) — F(@l2)) = —buF (e(v))e"v) = —F 


Expression (35) thus becomes 


7 _w fee) oo 
wn) 6 {0 vw) + e@))” 





and 





ory _ “a wv e_ wT, M2), oe) ]) 
(38) e*€?® = exp {rtw) Flo(Yw))) — ju’ — & [eu Fo + ae |. 


We shall now prove two facts relating to the function 


(39) A(u, ) = wt — F(t) — wG(w) + F(G(w)), 
where 
(40) £ = o(u + ¥(w)). 
(A) For any positive U 
(41) max |A(u,£) + 3u°| +0 (—— «); 
jutsu 


(B) there are constants wo and & such that, if 


(42) 2<|ul, u<¢'(&) — vw), fo = &, 
then 


(43) A(u, ) S — |u|/2. 
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For the proof of (A) let us differentiate 











(44) (Wz) = [@"(2)] 
to obtain 
(45) e" (W(2))'(z) = HG’@)T'@"(a). 
Thus 
' 1" _ G(x), 
(46) e”(Y(x)) = 2G(z)’ 
- 7 , a G’(x)G’""(z) - (G’"(z)P, 
(47) e'"(V(z))¥'(x) = 1G’ @P ; 
. e' (W(x) a Ga), 
i eWa)) ~ Aah 
ie ela) _ G(2)G""(2) — [6"@)F 
¢ (¥(z)) 2(G' (x) 
Translated into terms of G, (6) and (7) become 
(50) G(x) = o({@’(x)]') —" 
(51) G(x) = o([@’(z)/). 
In view of (50) and (51) it follows from (48) and (49) that 
, e’"(x) a 
(52) (a) 0 (x ) 
and 
(53) o""(z) — 0 (x1 &), 
¢’ (x) 
Now if |z — v| < const., then 
‘ ¢’(z) _— 
(54) v0) (z ) 
since 
55 og 2 2) _ [ (2) a, a 
(55) l ei) =) 7a dz — 0 (z ). 


The relations (52), (53) and (54) yield the result (A). 
For the proof* of (B) let us differentiate A(u, £) partially with respect to u, 


dA _ dA G(w) — E _ o(¥(w)) — elu + (w)) 


ise) Ow) ow) ew) 








— ay, Ou + Ww) 


¢ (¥(w)) 


3 This proof has been revised in accordance with a suggestion of J. J. Gergen. 
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Now by (54), 


(57) e. —u (—E— w) 
du 

uniformly for |u| < 2. Next let & be determined so that 

(58) -u-$s 4s -u4h; Ads -WH+1, 


for |u| S 2, & < € and let wo be so determined that 
(59) | G’(w) | = [G’(w)} 


for ¥(w) = ¥(wo). We shall now show that (B) holds with these values & , wo . 
In fact, let & < £ and let @ be any value of u such that 


(60) a <¢ (t) — (uw) 
and 
\aA | 





Then, since 


aA G’'(w) aA 
(62) ae = 1+ 4d een au’ 


it follows that 
aA 
In view of (56) it follows that 
aA 2 0 ifu < 0, 


(64) — ; 
au<0 ifu>O, 


and hence 


(65) | “ | <1 for u on the open interval (0, @). 
Since 
0A aA 
a ini = = 
(66) ou lL = 1, ou LL mis 1, 
it follows that ‘ 
(67) |a| S 2. 
Accordingly, 
(68) =< <-1 forte t, 2sSus¢ (t)—VW(w); 
z 0A - =A 
(69) = >1 fori: St, us —-2,us¢ (&) —V(w). 


(B) then follows. 
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Setting 

(70) p(t) = ¢ ‘(&) — ¥(w»), 

we see that (A) and (B) imply not only that for any positive \ 


(ple) 
(71) max ffPe- re -wontt niet, < My <a 
for all sufficiently large £, but also that for any positive 
(p(é)) 
(72) lim max |e 


E+ ye y—-ASusv 


ju? atte = 0, 


for the ends of each function run down, as we have seen, faster than e!!"! and 
any finite portion of 


(73) exp {w§ — F(€) — wG(w) + F(G(w))} 


—ju2 


approaches e 
Let us consider 


(74) > a ent" ® = > a em) —F(G(m)) eng—F ()—nG(n) +P (G(m)) 

wg wo 
Since this approaches s as § > ~, it follows that the sum is S s + 1 fort = & 
for some &. But exp {né — F(E) — nG(n) + F(G(n))} is greater than some 
constant h, > 0 for 0 S u SX, = = &, forsome & = &, where — = g(u + ¥(n)). 
Hence 


asi— a, 


— nG(n)—F(G s 1 
(75) Z. ane G(n)—F(G(n)) < s+ | g > §. 


Osusd hy : 


Now 0 S u S&S Xis equivalent to 


(76) ¢ '() -A\ < v(m) Ss o"() 

and hence 

77 > a,e°8 rem) < My (v ™ 0, 1, 2, hae ). 
¥S¥(n) Str 


From these facts we prove immediately that 


(78) > Pe tated T ee (E— «); 


n=Wwo 


indeed 


a 
2 Gp er tin F(G(a)) ie _ freer, | 
we ' 


(p(é)] 
wr —ty2 — - 
(79) < > if max |e tu? ant FP) poe | 
¥ 


y= y—-lsusv 


anenvremt +0 (¢-4 0), 


vy—-lsusr 
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and this proves (78). If we define 


(2r)* 7 a, "8 —F (em) —e t 
(80) L{y) = ¥(wo) S¥(n) <u 
“ for Hy] s ¥(wo), 


then (78) is equivalent to 
(81) [ie ~ite~s?* dL(y) => 7x Meal du (x — @ : 


Now let us apply the General Tauberian Theorem of Wiener (loc. cit., pp. 27, 28) 
to (81). The kernel e —* has the non-vanishing Fourier toinaiornn e**: the 
function L(y) is a monotone function such that 


v+h 
(82) [ dL(y) ee (2r)' ys a, e729 F(a) < (2n)' Mk 
y vS¥(n) svt 
and hence if 
. ( 1 for0 S —u Si, 
(83) Kx(u) = | 
. 0 otherwise, 
then 
(84) [ K(x — y)dL(y) > aI K.2(u)du = Xs. 
That is 
zt+h 
(85) | dL(y) > Xs (rx—> «), 
or 
(86) (2x) a er m—F (a(n) — (x — ). 


AX oz<¥(n)sz4d 
This concludes the proof of Theorem 1. 


3. For the proof of Theorem 2 we note that the class > of functions of the 
form (83) is such that there is no x for which for every function K2(u) of = 


(87) (Qn i K,(u)e™ du = 0. 


Thus (84) implies‘ (81) and this proves Theorem 2. This argument shows the 
validity of Theorem 2 even if (8) holds only for two values of \ whose ratio is 
irrational. 


* See Wiener, loc. cit., p. 26. 
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4. For the proof of Theorem 3 let (4), (5) and (10) hold. Then for some 


positive ¢€ 





(88) G'(F'(x)) = Fr) > iF ay E> %, 
and hence for every positive ¢€ 

(89) x * < G(z); 

(90) r** < (a), 


for sufficiently large values of z. From (90) it follows that for every positive «, 
(91) a** < (2) 


for sufficiently large values of z. This proves part (i) of Theorem 3. 
For part (ii) let (4), (5) and (13) hold. In view of (5) and (13) it follows that 
G(x) is O(a‘) for every « > 0. Applying the Schwarz inequality to (38), we 


have 
z z= j 
va) s[ [ae [ owas] 
(92) = 0(2'[G(z)}) 
= O(2'**) siti 


which yields (14). 


5. Let (15) be an integral function. The condition (16) gives 


2r ~ 
(93) x i | f(re"*) ? 0 os > b,, |?" us se’ log r) (r ty ). 
If we set 
(94) a, = |b, |’, t = logr’, 


(93) becomes 
(95) > ane ?® 5 (—— w). 


Applying Theorem 1 to this case, we see that the conclusions of Theorem 4 
follow. Theorem 5 is an immediate consequence of Theorem 2. In order to 
obtain the result on gaps stated in Theorem 6, let us assume that (11), (12) and 
(14) hold for sufficiently large values of n and let us consider the large values of 
n for which 


(96) xrsv(n)S2r4+.., 


Let g(x) be the inverse function to ¥(z). Then the values of n for which (96) 
holds are those for which 


(97) 





g(x) Sn S g(x + dX). 
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But 
g(x + ») = g(x) + Ag’(x + OA) 0<¢6<1 
1 
(98) = 12) + Toe yr H)) 


= g(x) + rO([g(x + aa)]***) 
= g(x) + rO([g(x)]}***). 
Thus the values of n for which (96) holds are those for which 
(99) g(z) Sn S g(x) + O((g(z)]*") 
and hence if s # 0, the function (15) can have only a finite number of gaps of 
magnitude (», v + v***). 
6. Let a, = 0 (n = 1, 2, --- ), and let 
(100) Cr = pe Am On—m (n = 1, 2, win ec ). 
1 


If (8) holds for every positive value of \, in view of Theorem 2 it follows that 
(2) holds and hence 


2 2 
(101) [= anc] etr® _, (— — o); 
I 


(102) > ce?" ® _, :? (§  @), 


1 
In view of Theorem 1, (102) implies that 
(2m)! 


(103) lim Oe aed ale a s, 
2-900 A zS¥(n)S2tr 
where 
F(z) = 2F(z); G(x) = G(4x); 
, , ‘ 1 , 
(104) Gi(z) = 36'Q2); vile) = Vda); 


vil(xz) = ~/2 ¥(42x) + const. 


Thus (103) is equivalent to (20). Similarly, if (20) holds for every positive 
value of A, (102) holds and hence (2) holds. This shows the complete equivalence 
of the two statements contained in Theorem 7. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 








SUMMABILITY OF CONJUGATE DERIVED SERIES 
By W. C. RaNDELS 

We consider a function f(x) with period 27 and integrable over (—7z, 7). 

The Fourier series of such a function is 
dag + . (a, cos nx + b, sin nz), 
where 
a, = a f(x) cos nz dz, b, = if f(x) sin nz dz. 

The conjugate derived series of f(x) is 


2 
> n(a, cos nz + b, sin nz). 


n=1 


We define 
o® = f(x +t) + f(x — t) — f(z) 


and 


&(t) = a i ; o(y) ese” dy dy. 


Throughout this paper we shall suppose that g(t)/t C L on (—z, 7). This 
implies that &(t) C L, since 


[ jeoias 2 | aw [ oly) ese ty| dy = 2 ay | o(y) exe? by | [at 


' Ifa@> land 





_ ‘ dy 
= o( [ | e(y) | W) 


mt = " 
==! [ (y(t —y)*"*dy—S as t— +40, 


we say that 
conj. der. lim g(t) = S(R’, a). 
If 0 < @ S 1, and for some 8 > 1 
—_ t 
Fok [ H(y)(t — y)*dy>S as t— +0, 
0 


Received June 5, 1935; in revised form, February 7, 1937. The author is indebted to the 
referee for suggestions, which materially simplified the proof. 
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we say that 
conj. der. lim g(t) = S(R’, a) 
if 
if 
- i f ely)y (tt —y)*" dy +0 as t— +0. 
The series 


2 
da, 
n=0 


is said to be (C, a) summable to the sum S if 


l m 
a Aya, — S as m-—®%, 


4am n=0 


where 





ge - Tim+a+D 
“* = Ta+lrat+i) 


The theorems which we propose to prove are: 
TuroreM I.’ Jf 


(2) conj. der. lim g(t) = S(R’, a) (a > 0), 


then the conjugate derived series of f(x) is (C, 8) summable to the sum S for 8 > 
a+l. 

TueoreM II. If the conjugate derived series of f(x) is (C, a), a 2 1 to the sum 
S, then for B > a 


conj. der. lim g(t) = S(R’, 8). 


These theorems are analogous to those proved for Fourier series by Paley* 
and Bosanquet,’ for the conjugate series by Paley‘ and for the derived series by 
Takahashi.® These proofs used Riesz summability instead of Cesaro, but the 
two methods are known to be equivalent. 

We may assume without loss of generality that the point under consideration 
is the point z = 0 and that the function is even. We also shall assume that 


1 Theorem I for a = 0 has been proved by Gupta in a paper of a similar title, Proc. Acad. 
Sci. Allahabad, vol. 1 (1936), pp. 7-17. A generalization of this has been given by Mour- 
sund, Amer. Jour. Math., vol. 57 (1935), pp. 854-860. It might be mentioned that Bosan- 
quet has considered the problem of the boundedness of the (C, a) transforms of the sequence 
n(a, cos nz + b, sin nx) by similar methods, Trans. Amer. Math. Soc., vol. 39 (1936), pp. 
189-205. 

?R.E. A. C. Paley, Proc. Cam. Phil. Soc., vol. 26 (1930), pp. 173-203. 

3L. S. Bosanquet, Proc. Lon. Math. Soc., vol. 31 (1930), pp. 144-164. 

* Lec. 

5 T. Takahashi, Tohéku Math. Jour., vol. 38 (1933), pp. 265-278. 
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a@ = Oand f(0) = 0. This is permissible, for otherwise we would deal with the 
function 


So(t) = f() — a — [f(0) + a] cos t. 
Then at z = 0, g(t) = 2f(t). We have 
(t) + x | o(y) cot? ty dy = -i[ ¢e(y)dy— —ia=0 as t> +40. 
do Je 4m Jt 


Hence, if a > 1, 


t 
= [ &(y)(t — y)*"*dy—>S as t—>+0 
0 


implies that 


t r 
= [ if g(x) cot? 3a iz} (t—y)**dy>S as t—> +40. 
0 v 
Moreover it is clear that for a > 0, 
t 

af oY) (4 _ y)* dy —» 0 as t— +0 

* Jo Yy 
implies that 


t 
af ely) cot dy (t— y)**dy>0 as t—> +40. 
0 


We set g(t) = y(t) cot $t and, since g(t)/t CL, g(t) CL. If yt) =g(a@+8) - 
g(x — t), then in the notation of Paley (2) implies that 


(3) conj. lim y(t) = 4.8(R’, a). 


We can easily see that conversely (2) implies (1). The Fourier coefficients of 
g(t) are a, = 0 and 


b, = 2 g(t) cot $¢ sin ni dt 
T Jo 


® : 1 ® 
= | ¢(t) sin (n + $8 1 - ff g(t) cos nt dt. 
tT Jo sin 3¢ wT Jo 


We are now in a position to prove our theorems. First by a theorem of Paley’° 
(3) implies that for y > a > 0, 


= a Aj_nb,—~4S as m2, 
But 
wo 2 TT 
= g(t) cos nt dt = ¢(0) = 0, 
n=1 7 JO 


6 L.c., Theorem II. 
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so that (3) implies that, fory > a > 0, 


| m 1 m 
— > Az_.S, = a > Ata, 7S as no. 
m n=0 “im n=0 


By a theorem of Andersen’ this in turn implies that 


1 m 
ge 2 Annan — S as no (@=y+1>a+1). 


This completes the proof of Theorem I. 
Now let us suppose that 


1l¢ ‘ 
— > AZ_.na, > S=0 as n> (a 
m n=0 


IV 


1). 
By the theorem of Andersen® used before, this implies that the series 


Dd Sn 
n=l 


is (C, a — 1) summable to 0. By a theorem of Paley’ this implies that if a = 1, 


conj. lim y(t) = 0(R’, B) (8 > a), 
and as we have seen, this implies that 
conj. der. lim g(t) = 0(R’, 8) (8 > a), 


and completes the proof of Theorem II. 

It is necessary to use the condition a = 1 in Theorem II in order to apply the 
theorem of Paley. As a matter of fact, Paley’s results hold for one lower index. 
Compare Bosanquet’s result for the analogous problem in Fourier series.” I 
have a direct proof of Theorem II for a > 0. 

It might be mentioned that Theorem I is true for a = 0 if 


conj. der. lim g(t) = S(R’, 0) 
is defined by replacing condition (1) by 


o(t) 


; —0 as t— +40. 


NORTHWESTERN UNIVERSITY. 


7 A. F. Andersen, Proc. Lon. Math. Soc., vol. 27 (1927), pp. 39-71; see Theorem 3. 
8 L.c., Theorem 3. 

*L.c., Theorem IV. 

1° L.c., Theorem 4. 
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ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 
By DunHAamM JACKSON 


1. Introduction. Polynomials in two real variables x and y orthogonal with 
respect to integration along a curve in the (z, y)-plane can be constructed by 
the usual process for building up a systém of orthogonal functions. If the 
curve is not algebraic, they have formal properties closely corresponding to 
those of polynomials orthogonal over a two-dimensional region.’ For an alge- 
braic curve the relations are different in important respects, reverting in some 
degree toward those which are familiar in the case of orthogonal functions of a 
single variable. This is to be pointed out in detail below, under hypotheses 
which, though not of the utmost generality, are still sufficiently illustrative. 


2. Orthogonal polynomials on a non-algebraic curve. Let g(t), y(t) be con- 
tinuous functions of t, of period A. If they are not both constant, the equations 
xz = g(t), y = V(t) may be regarded as defining a closed curve C (not necessarily 
of simple character). A relation of linear dependence connecting any finite 
number of the functions [y(¢)]’, [y(O)‘, h = 0, 1, 2, --- , k = 0,1, 2, --- , would 
mean that a polynomial in z and y vanishes identically on the curve, and so 
that the curve is the locus or a part of the locus of an algebraic equation. Let it 
be assumed for the present that no such relation of linear dependence exists. 

Let p(t) be a non-negative integrable function of period A, which, if not every- 
where positive, is at any rate such that its product with any polynomial in ¢(¢) 
and y(t) (having a non-vanishing coefficient) is different from zero for a set of 
values of ¢ of positive measure in a period. This condition will be satisfied, 
for example, if there is an interval in which p(t) is almost everywhere different 
from zero and for which the corresponding points (z, y) do not belong to an 
algebraic locus. 

Under the hypotheses that have been formulated any finite number of the 
quantities p’, pz, o'y, px’, pry, py’, --+ , regarded as functions of ¢ for 0 < 
t < A, are linearly independent. It is possible by ‘“‘Schmidt’s process of or- 
thogonalization” to construct from them a sequence of functions which are 
orthogonal and normalized over the interval. When the functions are 
taken in the order indicated, the members of the orthogonal set are of the form 
[0(t) gum 2x, y); = 0, 1, 2, a 0, 1, —? where x = e(t), y= v(t), 
and dnm is a polynomial of degree n in x and y together, while m is the exponent 


Received November 16, 1936; presented to the American Mathematical Society, Septem- 
ber 3, 1936. 

1 Cf. D. Jackson, Formal properties of orthogonal polynomials in two variables, this Jour- 
nal, vol. 2 (1936), pp. 423-434; referred to hereafter as paper A. 
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of the highest power of y occurring in a term of the n-th degree.” The q’s will 
be said to constitute a set of normalized orthogonal polynomials on the curve C, 
with respect to ¢ as parameter and p(t) as weight function, satisfying the con- 
ditions that 


[ eae, Y)drnlx,y)dt=0, |n—k|+|m—1| #0, 


[ p(t) {qan(x, yF dt = 1. 


Certain properties of these polynomials can be stated briefly, the proofs being 
so similar to those for the case of polynomials orthogonal over a region’ that 
it is unnecessary to give them in detail; it is to be borne in mind that under the 
present hypotheses, if the product of p by a polynomial in zx and y vanishes 
almost everywhere as a function of ¢ on the interval (0, A), the coefficients of 
the polynomial must all be zero. Any polynomial of the n-th degree which is 
orthogonal (in the sense under consideration) to every polynomial of lower 
degree must be a linear combination of gro, --- , an. Any n + 1 polynomials 
Pro, *** 5 Pon Which are expressible in terms of gno, --- , nn by an orthogonal 
linear transformation are orthogonal to each other, normalized, and orthogonal 
to every polynomial of lower degree. And any n + 1 polynomials of the n-th 
degree which are normalized, orthogonal to each other, and orthogonal to every 
polynomial of lower degree are related to any other such set, and in particular 
tO dno, *** 5 Qnn, by an orthogonal transformation. 

Suppose there is a number yz between 0 and A such that 


o(t + nw) = ag(t) + BY, 
(1) v(t + wu) = velt) + (0), 
p(t + uw) = p(t) 


for all values of t, where a, 8, y, 6 are constants. The determinant ad — fy is 
certainly different from zero; otherwise there would be a linear relation con- 
necting g(t + yz) and ¥(¢t + x), in contradiction with the hypothesis that the 
curve is not algebraic. If (xz, y) is a point of the curve, the point (2’, y’) with 
coérdinates given by 


(2) x’ = ax + By, y’ = yx + by 


is also a point of the curve, and vice versa; the transformation (2) carries the 
curve into itself. 


2 In a corresponding passage in the paper A, p. 424, ‘‘the degree with respect to y’’ should 
be amended to read ‘‘the degree of the leading homogeneous aggregate of terms with re- 
spect to y’’. 

3 See the paper A, pp. 424-425. 
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If P(x, y) is any polynomial in z and y, 


[ p(t)Plag(t) + BY(t), ye(t) + dy(t)ldt 
= | b p(t + uw) Plo(t + uw), Wt + w)ld(t + ») 
= [* otoPtoo, wana 


the last equality resulting from the periodicity of the functions involved. In 
summary, if P(x’, y’) as a polynomial in z and y is denoted by II(z, y), 


(3) [ conte, yar = [ wre, y’)dt = | wre, y) dt. 


For any polynomial p(z, y) which is normalized on C, let p(2’, y’) = x(x, y). 
Then the last relation, with P(x, y) = [p(z, y)]’, says that x(z, y) also is nor- 
malized. If p(x, y) and s(z, y) are two polynomials which are orthogonal to 
each other, and if p(x’, y’) = r(x, y), s(2’, y’) = o(z, y), the relation (3) with 
P = ps means that r(z, y) and o(z, y) are orthogonal. If p(z, y) is of the n-th 
degree, and the condition of orthogonality is satisfied for every polynomial 
s(x, y) of lower degree, then o(z, y) also is an arbitrary polynomial of degree 
lower than the n-th, and r(z, y) is orthogonal to every such polynomial. Applied 
in particular to any set of n + 1 polynomials pro, --- , Pan Of the n-th degree 
which are normalized, orthogonal to each other, and orthogonal to every poly- 
nomial of lower degree, this reasoning shows that pro(z’, y’), --- , Pan(2’, y’) 
are expressible in terms of pro(z, y), --+ , Pnn(z, y) by an orthogonal linear 
transformation. 

The work of the last three paragraphs applies also, with obvious modifications 
in detail, if the equations (1) are replaced by 


e(u — t) = ag(t) + BY(d), 
(4) W(u — t) = volt) + Spd), 
p(u — t) = p(t). 


Furthermore, the entire discussion remains valid if the functions ¢, ¥, p are 
defined merely for 0 < ¢ S A, the hypothesis of periodicity being dropped, 
and the closed curve C replaced by a non-algebraic arc, except that in place of 
(1) and (4) only the equations 

e(A — t) = ag(t) + BY(t), 

W(A — t) = yell) + SY), 
p(A — t) = p(t) 
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come into consideration. Other extensions readily suggest themselves, though 
it is not so clear what the most general possible formulation would be. 

Under conditions of some generality the transformation (2) must itself be 
orthogonal. Let the curve C be rectifiable, and let it be represented by the 
equations 


x = ¢(s), y = ¥(s) 


in terms of the arc-length as parameter. For simplicity, let it be made up of a 
finite number of ares, on each of which ¢(s) and ¥(s) have continuous deriva- 
tives. Then 


(5) [e'(s)F + W(s)F = 1 


except for a finite number of points at most. Let ¢ and y satisfy identically 
the equations 


o(u + 8) = ag(s) + ByY(s), 
¥(u + 8) = ve(s) + dy(s), 
or the alternative equations with un + s replaced by » — s. Then 
= [e'(u + 8) + Wu + 8) 
= (a + vle'(s)F + 2(aB + vd)e'(sW'(s) + & + FYI (SF, 
which with (5) means that 
(a° + 7° — I)le’(s)F + 2(a8 + vé)e'(s\W'(s) + (8 + — 1lW(s)F = 0, 


except at a finite number of points. Let the slope dy/dxr = y’(s)/g’(s) be de- 
noted by A. If \ takes on three or more distinct finite values, the quadratic 
equation 


— 


(a® + 7° — 1) + 2%(aB + yd)dA + (8° + F — 1)’ =0 
has more than two distinct roots, and its coefficients must vanish; the equations 
at+y=1, e+e =1, op + 75 = 0 


are the conditions of orthogonality. The same conclusion holds if ¢’(s)/y’(s) 
takes on three distinct finite values, and so if the slope takes on the value 
and two finite values different from zero; and again if there are regular points 
of the curve for which respectively ¢’(s) = 0, ¥’(s) = 0, and ¢’(s)¥(s) ¥ 0, ice., 
if the slope takes on the values 0 and & and a finite value different from 0. 

These brief observations raise numerous questions which are left unanswered. 
It may be pointed out, however, that the hypothesis that the curve has at least 
three different directions is not entirely irrelevant. For the parallelogram with 
vertices at the points (+2, 0) and (0, +1) (an algebraic curve, to be sure) is 
carried over into itself by the non-orthogonal transformation 2’ = —2y, y’ = 32, 
arising from the substitution of s + 5° for s. 

The pertinent parts of the discussion in the paper A relating to particular 
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transformations (2) and the orthogonal transformations of pro, +--+ , Pan in- 
duced by them, specifically in the case of the transformations x’ = —2,y' = —y 


(see footnote 6 of paper A) and 2’ = y, y’ = 2, can be carried over to the present 
situation. Also the reasoning of the second section of that paper, which is 
concerned with a recursion formula and a Christoffel-Darboux identity for the 
orthogonal polynomials, is applicable here with the obvious formal adaptations. 


3. Definition and orthogonal transformation of orthogonal polynomials on an 
algebraic curve. The foregoing conclusions have to be modified if the curve 
to which the discussion relates is algebraic. Let the weight function p(¢) for 
simplicity be taken as positive everywhere or almost everywhere on its range 
of definition, so that questions of linear dependence shall not be complicated 
by the possibility of vanishing of p. Let g(t) and Y(t) once more be continuous 
for 0 S t S A, or continuous everywhere and of period A, and not both con- 
stant, but let it be supposed now that there is a polynomial in x and y which 
vanishes identically for 0 < ¢ S A when z and y are replaced by ¢(¢) and y(é). 
Any multiple of such a polynomial will naturally have the same property. Ifa 
factor of a polynomial of this sort vanishes at only a finite number of points 
of the curve C, the quotient obtained by dividing out this factor will vanish at 
points arbitrarily near the finite number of points in question, and will vanish 
at those points by continuity and so at all points of the curve. There is there- 
fore a polynomial vanishing identically on C, and composed of irreducible factors, 
each of which vanishes at infinitely many points of C. 

If Q,(2, y) is a particular polynomial meeting these specifications, any irreduci- 
ble polynomial vanishing at infinitely many points of C must vanish at infinitely 
many points simultaneously with one of the irreducible factors of Q,(7, y), and 
must be identical with that factor except for a constant multiplier. There 
can be only a finite number of essentially distinct irreducible polynomials 
vanishing at infinitely many points of C; their product, determined except for a 
constant factor, may be characterized as the polynomial of lowest degree vanish- 
ing identically on C. Let this polynomial be denoted by Q(z, y), and let its 
degree in the two variables together be N; the choice of the constant multiplier 
is immaterial. (The irreducible factors are essentially real; if an irreducible 
polynomial with complex coefficients vanishes at infinitely many real points, 
its conjugate, vanishing at the same real points, must be a constant multiple 
of it.) Any polynomial which vanishes identically on C is a multiple of Q(2, y). 
The curve C is by no means necessarily the complete locus of the equation 
Q(x, y) = 0; it may be a triangle or a square, or otherwise composed of a number 
of algebraic arcs. 

Let the rank of a monomial zy for the moment denote the index of its posi- 
tion in the sequence 1, x, y, x, zy, y’, --- , so that if two terms are of different 
degrees in the two variables together, the one of higher degree has the higher 


* See for example, M. Bécher, Introduction to Higher Algebra, pp. 210-211. 
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rank, while if two terms are of the same degree the one which is of higher degree 
in y has the higher rank. If the terms of two polynomials are arranged accord- 
ing to rank in descending order, the leading term of the product of the two 
polynomials is the product of their leading terms. If the leading term of 
Q(x, y) is 2’y*, p + q = N, the leading term of any polynomial which vanishes 
identically on C must be divisible by 2’y*. And if h and k are any two exponents 
such that h = p, k = q, multiplication of Q(z, y) by 2” ’y* gives a polynomial 
vanishing identically on C, and having x"y‘ for its leading term. In other 
words, among the monomials x", x"""y, --- , y" of the n-th degree, when n = N, 
each member of the set 2" “*y’, 2" *"y*", ...,2”y"” is linearly expressible 
on the curve C in terms of monomials of lower rank than its own, while none of 
the remaining N monomials of the n-th degree, in which the first exponent is 
less than p or the second exponent less than g, is connected with terms of lower 
rank than its own by any relation of linear dependence on C. If all terms zy‘ 
in which simultaneously h = p and k = q are omitted from the sequence 1, 
x,y, x, ry, y’, «+», Schmidt’s process can be applied to the remaining terms, 
taken in the order indicated, and yields a set of normalized orthogonal poly- 
nomials comprising just N polynomials of the n-th degree for each value of 
n 2 N (and n + 1 polynomials of degree n for n < N). 

The preceding argument as it stands is of course based essentially on a par- 
ticular arbitrary arrangement of the monomials z’y* in serial order. It remains 
to be seen to what extent it possesses more general significance. 

In the orthogonal system that has been described let the polynomials of the 
n-th degree be denoted by gai, G2. +--+ , Gav, for n 2 N, where the first sub- 
script indicates the degree in the two variables together, and for fixed n the 
second subscript increases with the exponent of the highest power of y occurring 
in a term of the n-th degree, but is not in general equal to that exponent. The 
modifications required for values of n < N will be obvious. Let the leading 
term of a polynomial be characterized still in terms of the notion of rank already 
employed. Each of the monomials x", x”‘y, --- , y", with an appropriate co- 
efficient, is leading term either of one of the polynomials q,, or of a polynomial 
which vanishes identically on C. If p(z, y) is an arbitrary polynomial of the 
n-th degree orthogonal to every polynomial of lower degree, suitable constant 
multiples of the n + 1 polynomials just mentioned can be subtracted from 
p(x, y) to remove successively all the terms of the n-th degree in order of de- 
creasing rank, leaving a polynomial which, being orthogonal to every polynomial 
of degree lower than the n-th, must be orthogonal to itself and so identically 
zero on the curve. This means that p(x, y) is equal on the curve to a linear 
combination of Ga, +++, Qa , Or as a polynomial in x and y is the sum of such 
a linear combination and a polynomial which contains Q(x, y) as a factor. 

It follows that any set of N polynomials px, --- , Pay of the n-th degree 
which are normalized, orthogonal to each other, and orthogonal to every poly- 
nomial of lower degree, can be expressed on C in terms of qui, +++ , Gav by an 
orthogonal transformation. Clearly any set of N polynomials thus expressible 
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in terms of the q’s will have the properties mentioned. And since for points 
on the curve the q’s are given in terms of the p’s by the inverse orthogonal 
transformation, any polynomial of the n-th degree orthogonal to every poly- 
nomial of lower degree can be linearly represented on the curve in terms of the 
p’s, and any two sets of p’s are related to each other on the curve by an orthog- 
onal transformation. It is perhaps not superfluous to emphasize that all these 
relations, which for the polynomials in x and y have the character of congruences 
with respect to Q(z, y) as modulus, are identities in the functions of ¢ which 
are obtained on replacement of x and y by ¢(t) and Y(t). In terms of x and y 
the individual polynomials of the n-th degree in the orthogonal system are sub- 
ject not merely to the indeterminacy involved in the admissibility of orthogonal 
transformation, but also to the addition of arbitrary polynomials of the n-th 
degree containing Q(x, y) as a factor. 

If the curve C is a straight line segment, N = 1, and the orthogonal system 
contains just one polynomial of each degree. With s as parameter and weight 
function unity the orthogonal polynomials are essentially the Legendre poly- 
nomials in s, and introduction of a weight function p(s) gives rise to the cor- 
responding set of orthogonal polynomials in a single variable. 

If C is the unit circle, with the parametric representation 


xr = cost, y = sint, 0st S 2z, 


and p(t) = 1/7, a polynomial in z and y is a trigonometric sum in ¢, the orthog- 
onal polynomials of the n-th degree (NV = 2), chosen with the exercise of a 
particular option as regards orthogonal transformation, reduce on the curve to 
cos nt and sin nt for n = 1, and in terms of x and y they are any polynomial 
representations of cos nt and sin nt in terms of cos ¢ and sin ¢, e.g., a polynomial 
of the n-th degree in x and the product of y by a polynomial of degree n — 1 in z, 
or, alternatively, the real and pure imaginary parts of (x + iy)". A general 
orthogonal transformation on cos nt and sin nt has the effect merely of replacing 
them by cos (nt + k) and + sin (nt + k). Admission of a non-constant weight 
function p(t) leads to more general orthogonal trigonometric sums of the type 
discussed in a recent paper by the writer.” 

Substitution of the ellipse x = a cost, y = b sin ¢ for the cirele, with a given 
p(t), does not change the orthogonal functions as to their dependence on 1, 
but changes the coefficients of their representation in terms of x and y, by the 
substitution of z/a and y/b for x and y. Solution of the problem for the ellipse 
with s as parameter and constant weight function would correspond to the 
construction of orthogonal trigonometric sums in ¢ (not trigonometric sums in s) 
with p(t) = ds/dt. 

The rather obvious remarks of the last three paragraphs are inserted merely 
for the sake of showing more clearly how the theory under discussion may be 


5D. Jackson, Orthogonal trigonometric sums, Annals of Mathematics, vol. 34 (1933), 
pp. 799-814. 
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regarded as a generalization of that of the most familiar orthogonal systems in 
one dimension. 

If ¢, ¥, p satisfy a set of relations (1) or (4), and if x’, y’ are defined by (2), 
the reasoning carried through for a non-algebraic curve can be used now to show 
that pr(a’, y’), --- , Paw(2’, y’) are expressible on the curve in terms of pyi(z, y), 
-++ , Pan(x, y) by an orthogonal transformation, provided that C is not a straight 
line segment. In the excluded case the inference as to the non-vanishing of 
aé — y is inadmissible; there is perhaps no need of dwelling further on this case 
in the two-dimensional setting, the facts with regard to orthogonal polynomials 
in a single variable being well known. In particular, if (2) is the transformation 
r= —2,y' = —Yy, Pri(2’, y’) is identically equal on the curve to (—1)"p,.(z, y), 
fori = 1,2,---,N. But the transformation 2’ = y, y’ = x requires further 
consideration, which will not be undertaken here; for in the two-dimensional 
case,® or in the case of a non-algebraic curve, it is possible to say that the poly- 
nomial q,0(z, y), for example, is neither symmetric nor skew-symmetric, while 


on the circle 2° + y° = 1, with constant weight function, the corresponding 
polynomial for n = 2 satisfies the congruence-identity 
227 — 1 = —(2y* — 1). 


4. Recursion formula and Christoffel-Darboux identity on an algebraic curve. 
By reason of the fact that the number of polynomials of the n-th degree in the 
orthogonal system does not increase indefinitely with n, the recursion formula 
and the Christoffel-Darboux identity have a somewhat closer resemblance to 
those found in the case of a single variable than when the domain of orthogo- 
nality is a non-algebraic curve or a two-dimensional region. It is to be kept in 
mind, however, that the various identities hold in general only on the curve. 

Let pni(z, y) be an arbitrary one of the polynomials of the n-th degree. The 
product zrpni(z, y), as a polynomial of degree n + 1, is expressible on the curve 
in the form 


(6) xpaz,y) = DojAniiPasrle, y) + Di Bris Pailz, y) + Doi CniiPr—aslz, y), 
with 


Auj = [ orate, Y)Pn+i,i(x, y) dt, 
Bui = [ p(t)apailr, y)pni(a, ydt = Bj, 


Casi = [ ozone, Y)Pn—1, (2, y) dt = A n-1, ji . 


Terms of lower degree are absent from the right-hand member because the 
corresponding coefficients vanish, by reason of the property of orthogonality. 


® See paper A, p. 427. 
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In this section, in the absence of express indication to the contrary, the sign = 
always denotes summation over the designated index from 1 to N. The 
expression (6) is then appropriate in the first instance for values of n 2 N; it 
can be made formally valid for 0 < n < N — 1, however, by adoption of the 
convention, in the integral formulas for the coefficients as well as in the identity 
itself, that p,(z, y) = Owhen0 Sn+1<j SN. With this convention, 
which will be maintained henceforth, the identity holds trivially when Ll S n + 1 
<i < N in the left-hand member, as well as significantly when p,;(z, y) is not 
identically zero. The recursion formula (6) resembles that for a single variable 
in that the number of its terms does not increase with n. 

Multiplication of (6) by p,i(u, v), followed by summation over the index ¢ 
from 1 to N, with replacement of the coefficient C,;; by its equal Aj_1,;;, gives 


x Di Pni(2, Y)Pni(u, v) = 2a dA nij Pn+i,i(2, Y)Pni(u, v) 
+ Li Di Basi Pailz, y)Pniluy v) + Dos DA v1.48 Puri, y) nil u, 0). 


On interchange of (x, y) with (u, v) this becomes 
U DoiPni(x, y)Pnilu, v) = Doi Dos Anis Pngs,i(U, v)Pai(, y) 
+ Di Doi Bnij Pnj(U, Vv) Pni(z, y) + ) >A n—1,ji Pn—1,j(U, V)Pni(2, y). 


In the second summation on the right-hand side of (8) let the index symbols 
i and j be interchanged, the coefficient B,;; being restored, however, by virtue 
of the fact that B,;; = B,j; ; the sum in question then becomes identical with 
the corresponding sum in (7). Subtraction of (7) from (8), with interchange 
of ¢ and j in the last summation of each identity, gives 


(u _ x) DiiPas(z, y)Pni(u, v) 
(9) = dos DA nis lPngs,j(U, v)Pni(Z, y) — Pasrs(2, y)Pni(u, v)] 
— Didi Anrislpalu, v)pa—i(z, y) — Pailt, y)Pn—rs(u, v)]. 


For the evaluation of 


n N 
K,(2, y, u,v) = >> Pei(X, Y)Pei(u, v) 
k=0 11 
let (9) be written with k in place of n fer each value of k from Oton. The result 
of summation with respect to k, when due account is taken of the fact that 
p-1.; = 0 for all values of 7, is 


(u — x)K,(2,y, u,v) = >i DA nijLPn4t,j(Uy V)Pni(X, Y) — Pnsr,i(Z, Y)Pni(u, v)]. 


In this relation, which has the character of a Christoffel-Darboux identity, 
the number of terms again remains constant as n increases. There is naturally 
a corresponding formula for (v — y)K,(z, y, u, v), differing from this only in the 
values of the coefficients Ai; . 
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THE CLASSES OF INTEGRAL SETS IN A QUATERNION ALGEBRA 
By CLAIBORNE G. LATIMER 


1. Introduction. Let % be a rational generalized quaternion algebra with 
the fundamental number d.’ A set of integral elements in &, or more briefly 
an integral set, is one with certain properties R, C, U, M as defined by Dick- 
son.” Two integral sets are said to be equivalent, or of the same type, if there 
is a one-to-one correspondence between the elements of the sets which is preserved 
under addition and multiplication. All the sets equivalent to a given set will 
be said to form a class. Two integral sets, @ and @,, belong to the same class 
if and only if there is a non-singular element a in & such that @, = aGa’.* 

By a result due to Artin,* the number H of classes of integral sets in % is equal 
to the number of classes of equivalent right ideals in an arbitrarily chosen inte- 
gral set G, of A, Artin’s definition of equivalent ideals being broader than the 
usual definition. 

The principal purpose of this paper is to show that there is a one-to-one cor- 
respondence between the classes of integral sets in % and certain classes of 
ternary quadratic forms. These classes of forms are the non-negative classes 
or the improperly primitive non-negative classes in a certain genus G, according 
as d is even or odd. G is uniquely determined by d. If d < 0, by a known 
theorem there is a single class of forms in G and therefore H = 1.° 

We shall also determine a relatively simple basis of an arbitrarily chosen 
integral set in Y%. 


2. A normal basis of an integral set. Ifo, --- , \3 forma basis of an integral 
set G, then AA; = Vicipndrr (i, 7 = 0,---,3). A set G, is equivalent to G if 
and only if it has a basis &, --- , & such that ££; = Dyesjyé, (i,7 = 0, --- , 3). 
The following theorem is a consequence of certain results due to Brandt.* 

THEOREM 1. Let % be a generalized quaternion algebra with the fundamental 


Received November 16, 1936. 

! For the definition of d, see Brandt, [dealtheorie in Quaternionenalgebren, Mathematische 
Annalen, vol. 99 (1929), p. 9. 

2 Algebras and their Arithmetics, pp. 141, 2. It may be shown that our definition of 
an integral set is equivalent to Brandt’s definition of a mazximaler Integritdtsbereich, loc. 
cit., p. 11. 

’ Deuring, Algebren, p. 89. 

* Abhandlungen aus dem Mathematischen Seminar der Hamburgischen Universitit, 
vol. 5 (1927), p. 288, Theorem 20. 

5 In another paper, it was shown that if d < 0, then every one-sided ideal in an integral 
set is principal. (Transactions of the American Mathematical Society, vol. 40 (1936), 
p. 322.) From this and Artin’s result, cited above, it again follows that if d < 0, then 
H = 1. 

® Loc. cit., pp. 8-11. 
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number d, and let » = 1, --+ , 3 be a basis of A. The d’s form a basis of an 
integral set if and only if N(Zxi) = © (to, ++ > , Xs) = § Vgijrix;, where 
(a) the coefficients of © are integers, 9i; = 9ji, Go = 2; 
(b) the determinant | gi; | = a’; 
(c) every third order minor in the matrix (g;;) is divisible by d and every principal 
third order minor is divisible by 2d. 

Two integral sets in U are equivalent if and only if they have basal elements Xo, --- , 
hz and fo, «++ , 3 respectively, such that N(2xA;) = N(=zié;). 

Let G be an integral set in & with the basis Ay = 1, --- , As and let N(Zz,A;) = 
} Sgi;2;2; as in Theorem 1. Suppose every go; were even. Then d would be 
even and, since every gi; = 29;; is even, by (c) of Theorem 1 every g;; would be 
even. Then d would be divisible by 4, whereas d contains no square factor > 1. 
Therefore one of the go; is odd. We may then assume, after an integral trans- 
formation of determinant unity, that gu = 1, go = gos = 0. 

The trace, or double the scalar part, of X = z,d; is T(X) = 2x) + 1. 
Then T(x) = 1, T(Qe) = T(As) = 0. Such a basis of G will be called a normal 
basis. For a normal basis, the matrix (g;;) of 2 @ is in the form 


2 1 0 O 
(1) M= 1 gu gi Gis 

O ga 922 2s 

O gs: gs2 933 


Since the determinant | M | = d’, and every gi: is even, it follows that gx: = 
d (mod 2). 


3. The class of ternary forms corresponding to a class of integral sets. Let 
ho = 1, --- , As be a normal basis of the integral set G, and let § be the module 
consisting of all elements in @ of trace zero. Since T(227,A;) = 2% + 2, an 
element is in § if and only if it may be written in the form X = 2,(2A, — 1) + 
odo + 233, Where the z’s are integers. The norm of the general element 
in § is N(X) = f(a, 22, x3) where f is the ternary quadratic form with the matrix 


2gunr—1 Qe 9is3 
r = J21 go 3 gos |. 
gsi 3 gs2 93s 


f is a classic form, i.e., the coefficients of all cross-product terms are even, if 
and only if dis even. Since | M | = d’, it may be shown that | T° | = d°/4. 
Suppose &, --- , &; form a normal basis of G. Then 2& — 1, &, & form a 
basis of § and therefore these elements are obtained from 2d; — 1, Az, As by an 
integral transformation of determinant + 1. Hence the form N[y(2& — 1) + 


7 Brandt, loc. cit., p. 12. 
* Brandt, loc. cit., p. 10. 
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yot: + ysks] is equivalent to f. Hence the class ©, of forms equivalent to f 
is independent of the particular normal basis employed and is uniquely deter- 
mined by G. 

More generally, suppose @, is an integral set in the same class as G. By 
Theorem 1, @, has a basis m, --- , 93 such that N(Z2imi) = ®. Hence &, is 
uniquely determined by the class of integral sets containing G. 

Suppose d = 2d, iseven. It will be found that every second order minor of T 
is equal to one half or one fourth of one of the third order minors of M. There- 
fore by Theorem 1 every such second order minor is divisible by d,. Since 
|T | = di, it follows that + d, is the g.c.d. of these second order minors. By 
the definition of d, # is positive or indefinite according asd > Oord <0. Hence 
the same is true of f and the invariants of f are Q = d;, A = 1.’ Since d, is odd 
and contains no square factor > 1, f is a properly primitive form. 

If d is odd, it may be shown that the invariants of the improperly primitive 
form 2f are Q = d, A = 2. 

The class of forms containing f or 2f, according as d is even or odd, will be 
said to correspond to the class of integral sets containing ©. 

Let F be a quadratic field which is imaginary if d > 0. It is known that % 
contains a field equivalent to F if and only if no prime factor of d is the product 
of two distinct prime ideals in the set of all integral numbers of F."° If d is even, 
f is a properly primitive form and if d is odd, 2f is improperly primitive. There- 
fore, in either case, f represents a positive integer k, prime to 2d." Then § 
contains an element , such that > = —k. Since % contains the quadratic 
field F(n), it follows that —k is a quadratic non-residue of every odd prime fac- 
tor, 91, 2, °**,q.0fd. Hence if d is even, the characters of f are” 


(2) xi = (*) = (—j)e*o* (= 1,2 ---, 2). 


If d is odd, the invariants of 2f are Q = d, A = 2 and therefore by the last refer- 
ence, the characters of 2f are 


(3) xi = (2*) = (5) estOerter (i - 1, 2, wee, n). 


qi 


We have then all but the last sentence of 
THEOREM 2. Let % be a rational generalized quaternion algebra with the funda- 
mental number d. For every class of integral sets in A there is a uniquely deter- 


® For the definitions of these invariants, see Dickson’s Studies in the Theory of Numbers, 
p. 10. This book will be referred to hereafter as Studies. 

10 This is a consequence of a more general theorem by Hasse, Die Struktur der R. 
Brauerschen Algebrenklassengruppen tiber einem algebraischen Zahlkérper, Mathematische 
Annalen, vol. 107 (1933), p. 731; Deuring, Algebren, p. 118. See also The quadratic sub- 
fields of a generalized quaternion algebra, this Journal, vol. 2, p. 681. 

1 Dickson, Studies, Theorems 6, 7, p. 8. 

2 Dickson, Studies, p. 52. 
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mined corresponding class © of non-negative ternary quadratic forms. If d = 2d, 
is even, the invariants of the forms in © are Q = di, A = 1 and their characters 
are the x; in (2). If d is odd, the forms in € are improperly primitive and have 
the invariants Q = d, A = 2 and the characters x; of (3). No class of forms cor- 
responds to two classes of integral sets. 

To prove the last sentence of the theorem, we employ the following which 
will be proved in §6. 

Lemma 1. Let G, G, be sets of integral elements in A and let §, F: be the sets 
of elements in G, @,, respectively, of trace zero. If § = F1, thenG = G,. 

Let Xo, «++ , As and &, «++ , & be normal bases of the integral sets G and G, 
respectively. Suppose the form N[x,(2A: — 1) + ede + asd3] = f(x, re, Xs) 
is transformed into N[yi(2i — 1) + yets + ysts] = filys, ye, ys) by a trans- 
formation z; = 2,tiy; (¢ = 1, 2, 3), where the ?’s are integers, |#;;| = + 1. 
If wis rational and T(Z) = 0, then N(u + Z) = uw? + N(Z). Hence N(Zy:é:) = 
yo + your + yi/4 + filyr/2, ye, ys) = Pilyo, --*, Ys). Let m = 1 and let 
m, n2, ns be defined by 


2m —1 ty (2Ay - 1) + torr2 + tziA3, 
(4) m2 = te(2A, — 1) + teedr2 + teds, 
ns = ts(2A1 — 1) + tasdr2 + tosds. 


Then Y; = yi(2m == 1) ao Yone ao Y3n3 = x (2r, = 1) + Lede oa X3A3 and N(Y¥1) = 
filth, Y2, ys). Therefore if Y = Lyini, then N(Y) = fi(yo, m1, ye, ys) = 
N(yié:). Since the n’s form a basis of % and the ¢’s form a basis of @,, by 
Theorem 1, the ’s form a normal basis of an integral set ©’ equivalent to @. 
But by (4) and Lemma 1, @' = @. Hence G and G, belong to the same class. 
This completes the proof of Theorem 2. 

If d is even, the forms in € and also their reciprocals have odd determinants 
and therefore are properly primitive. Therefore by Theorem 2 and a known 
result,’* we have the 

Corotiary. If d < 0, any two integral sets in A are equivalent. 


4. The correspondence between classes of integral sets and classes of forms. 
We shall now prove 

THeoreM 3. Let % be a generalized quaternion algebra with the fundamental 
number d. If d is even, let G be the genus of forms with the invariants Q = d/2, 
A = 1 and the characters x; of (2). If d is odd, let G be the genus of forms with 
the invariants 2 = d, A = 2 and the characters x; of (3). There is a one-to-one 
correspondence between the classes of integral sets in A and the non-negative classes 
in G or the non-negative improperly primitive classes in G, according as d is even 
or odd. 

If d < 0, the theorem follows from Theorem 2 and the theorem cited in the 
proof of the corollary. Hence we shall assume that d > 0. By Theorem 2, it 


‘8 Dickson, Studies, Theorem 47, p. 54. 
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will be sufficient to show that every positive class or every improperly primitive 
positive class in G corresponds to a class of integral sets. 

Suppose d = 2d, is even. Let € be a class of positive forms in G. Let 
y = Dh;,x,x; be the reciprocal of aform in ©. We have seen that y is a properly 
primitive form and therefore it represents an integer prime to d."* We may then 
assume that h33 is prime tod. Let f = Ya,;x;x; be the form in € reciprocal to 
y, and let A;; be the cofactor of a;; in the matrix (a;;). Then aya2 — aj. = 
A33 = d,h, where h = hs is prime tod. Consider the binary form f(x;, 22, 0). 
Since d; contains no square factor > 1 and is prime to h, the g.c.d. of the co- 
efficients of this form is prime to d,. Therefore f(7,, x2, 0) represents an 
integer prime to d;. Hence we may also assume that ay is prime to d;. Let 

= ak’, ay, = at’, where a, a contain no square factor > 1. In f(z;, 22, 23), set 


a2 Ais 

yu=-t-— — Zz 

1 .** i” 
A 

2 = y+ 72, 
33 

z%3 = z, 


and in the resulting form, replace z, y, z by x/t, atz/k, aky respectively. We 
obtain 


V(x, y, 2) = ax” + By’ + af", 


wheré B = ad;. 

Since h is prime to d,, 8 contains no square factor > 1. Let a = a6, 8B = 8,6, 
where 6 is the g.c.d. of a, 8. Since ay is prime to d,, 8; is divisible by d,. Let 
B be the least positive divisor of 6; such that y° + a = 0 (mod 8,/B) has a 
solution. Since f represents ay, = at’ and the characters of f are the x; of (2), 
it follows that —a is a quadratic non-residue of every prime factor of d;. Hence 
d, divides B. 

Let %’ be an algebra with the basis 1, i, j, ij, where 7” = —a, jj? = —8, ij = 
—ji. The fundamental number d’ of &’ is divisible by B.” Hence d’ is divisible 
by d,. 

Since Y = Lh;;x,; is properly primitive, after an interchange of variables 
necessary, we may assume that hy is odd. Replace 7 by 2, + fe + nfs, 
where — = 0 or 1 according as hye is even or odd and » = 0 or 1 according as hy; 
is even or odd. In the resulting form, y = Lk raj, ku + 1 = ke = ky = 0 
(mod 2). Since | k;;| = d, is odd, it follows that ke; is odd. Let fy = Lei j;ria; 
be the form in € reciprocal toy. Then en = keeks; — k3s, CiurCe2 — Cre = dykes, 
CuCss3 — Ci, = dike, and therefore 


(5) Cu 


—1 (mod 4), co = Cy, C33 = Cis (mod 2). 


4 Dickson, Studies, p. 14, Theorem 16. 
% This Journal, vol. 1 (1935), p. 435. 
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Let 9i;, gi; be the integers defined by equating the matrix I’ of §3 to (c;;) 
and let goo = 2922, 93s = 2933. Consider the matrix M = (g,;) of §2 with the ele- 
ments g;;asthusdefined. Since | ¢;; | = dj, it may be shown that | M | = 4dj = d’. 
Let Go = | gi; |, (i,j = 1, 2,3). We have d’ = 2Go — (g22gss — gis). Since 
J229x3 — G23 = 4(C20¢33 — C33) = 4d,ky, and ky is odd, it follows that Goo is divisible 
by 2d. Noting that by (5), gx + 2gi2 = gss + 29is = 0 (mod 4), and employing 
the fact that every second order minor of (c;;) is divisible by d:, we may show 
that M = (g;;) satisfies the remaining conditions of (c) in Theorem 1. 

We have seen that f is transformed into W by a non-singular transformation 
with rational coefficients. Hence ¥(y:, ye, ys) is transformed into f,(a1, 22, 23) 
by such a transformation, y; = 2js;;7; (¢ = 1, 2,3). Let wi, w:, ws be elements 
of Yt’ defined by 





Sut + SaJ + 8310) , 


20, — 1 


we = Sil + SJ + 832i], 
w3 = 8130 + S239 + S330). 


Then w = 1, --- , ws form a basis of YA’. We have X; = 2:(20: — 1) + Tow. 
+ x33 = Yt + Yoj + ystj and therefore N(X,) = fi(ai, x2, 23). Since T(X,) = 
0, it follows that if X = Yzw,;, then N(X) = xe + tori + 23/4 + fi(ai/2, xe, 23) 
= P(x , 21, Xe, X73), Where M is the matrix of 2. 

We may set 1 = #, i = wii,j = 6 ie, ij = (a8)'i3, where the i; are the 
hamiltonian quaternion units. Then wo, = Lxleat, where the ?’s are real. 
Since the determinant of N(Zz,w,) is d’/16, it follows that, after changing the 
sign of ws; if necessary, d = 4 | tin |. 

Feuter considered an integral set in a quaternion algebra with the funda- 
mental number d."° If w) = 1, --- ,w3 form a basis of this set, he determined 
explicitly the integers c;;, defined by ww; = xc; jw, in terms of the coefficients 
of the form N(22w;) = }$29;;r;2; and the qg’s defined by certain relations be- 
tween the w’s. Since his w’s form a basis of an integral set, it follows from his 
definitions of the g’s that they are integers. 

Consider Fueter’s argument, beginning on p. 651 and leading to equations 
(11) on p. 654, as applied to the present w’s. The argument holds without 
modification except that the definitions of the g’s now imply only that they are 
rational. However, since they satisfy his equations (7), p. 653, it follows that 
every dg;; is equal to the cofactor of one of the elements of M. Since M satisfies 
(c) of Theorem 1, it follows that every g;; is an integer. Hence by Fueter’s 
equations (11), p. 654, ww; = Lej;,, where each c is either an integer or half 
an odd integer. By a result due to Brandt, the c’s are all integral.” 

Let @ be the set of all elements XY = Sxr,w; with integral z’s. By definition 


1 Zur Theorie der Brandtschen Quaternionenalgebren, Mathematische Annalen, vol. 110 
(1934-5), pp. 651-4. 

\7 Der Kompositionsbegriff bei den quaterndren quadratischen Formen, Mathematische 
Annalen, vol. 91 (1924), p. 303. 
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of the w’s, T(X) = 2%) + 2. Since N(X) = (a, --- , 23), it follows that G 
has the property # used in our definition of an integral set. By the preceding 
paragraph, @ has the property C. It obviously has the property U. Let G’ 
be an integral set in &’ containing G, i.e., a maximal set with the properties 
R, C, U and containing @. The norm of the general element in @’ is a form 
&’ and by Theorem 1, the determinant of 26’ isd”. Since ’ is transformed into 
® by an integral transformation, it follows that d’ divides d. But we have seen 
that d, divides d’. Hence d’ = d, ord’ = d. Since every positive fundamental 
number is the product of an odd number of prime factors, it follows that d’ = d 
and hence %’ is equivalent to %."° We may therefore identify %’ with 9%. 
Then @’ = G@ is an integral set in A. Since N[x(2e: — 1) + row, + raw] = 
fi(ai, 22, 23), it follows that € is the class of forms corresponding to the class of 
integral sets containing @. This proves the theorem for the case where d 
is even. 

Suppose d is odd. Let € be a class of positive improperly primitive forms in 
G and let 2f = aj;xz; be a form in ©. Since | a;;| = 2d’, one of the a;;, 
Say G23, is odd. If ay is odd, replace x; by 2; + 23 and if aj; is odd, replace 
re by 41 + 2. We may then assume ay: = a)3 = d23 + 1 = 0 (mod 2). Every 
term in the expansion of | a;; | is divisible by 8 except —ay,a3;. Hence a, = —2 
(mod 8). Let gi;, g:; be the integers defined by equating the matrix T' of §3 
to the matrix (a;;/2) of f, let go = 2922, J33 = 2933 and let M be the matrix of §2 
with the elements g;; as thus defined. Since the invariants of 2f are Q = d, 
A = 2, it may be shown that M = (g;;) satisfies all the conditions of Theorem 1. 

We may then show, in the same manner as for the case where d is even, that 
there is an integral set G in A with a normal basis wo, --- , ws such that N(2z,w,) 
= (x, --- , 23), where M is the matrix of 26 and N[x,(2a; — 1) + xewe + ryws] 
= f(xi, x2, 73). Then € is the class of forms corresponding to the class of 
integral sets containing @ and the theorem is proved. 


5. A basis of an integral set. Let F be a quadratic field with the discriminant 
—r, where 7 is a prime not a divisor of d. By a result previously referred to, A 
contains a field equivalent to F if and only if no prime factor of d is the product 
of two prime ideals in F. This condition is satisfied if and only if — 7 is a quad- 
ratic non-residue of every odd prime factor of d and r = 3 (mod 8) if d is even. 
It is known that for every such field, called a canonical splitting field, % has a 
basis 1, i, j, ij, where i = —7, 7° = —d, ij = —ji." Such a basis of % will be 
called a canonical basis. 

Let @ be an integral set in % and let F be a canonical splitting field of % 
with the discriminant —r. The intersection of & and F is a ring with a basis 1, 
cw, where 1, w form a basis of all the integral elements of F and c is a positive 
integer. Hull determined a basis v, v;, v2, v3 of @ such that m = 1, 1, = cw.” 


18 Brandt, loc. cit., pp. 12, 13. 
19 This Journal, vol. 1 (1935), p. 435. 
20 Transactions of the American Mathematical Society, vol. 40 (1936), p. 8. 
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c is not in general equal to unity. We shall show that there is an infinitude of 
canonical splitting fields such that for each of them, ¢ = 1." Choosing F as 
such a field and setting c = 1 in the v;, we obtain from them a relatively simple 
normal basis of G. 

Let wo, «++ ,@3 be anormal basis of G and let f(x, x2, x3) be as defined in §3. 
We shall first prove 

Lema 2. f(a:, 272, 2x3) represents an infinitude of primes. 

Let d’ = d/2 and f'(a1, x2, 23) = f(a1, 22, 23) or let d’ = d, f’(x1, x2, 23) = 
f(x1, 22, 2x3) according as d is even or odd. If d is even, we have seen that the 
invariants of f’ are Q = d’, A = 1 and the reciprocal y’ of f’ is properly primitive. 
It may be shown that f’ and y’ have these properties if d is odd. 

For properly chosen integers C;, ¥/(Ci, C2, C3) = C > 0, where C is prime 
tod’.” After adding properly chosen multiples of d’ to the C;, we may assume 
C, = 0, C2 = C; = +1 (mod 4). Finally, without disturbing the preceding 
conditions, we may assume that the g.c.d. of the C; is unity. Let aq = 
(Cz + C3)/6, ag = as = —C,/6, where 6 is the g.c.d. of C; and C, + C;. Then 
a; is odd, az and a; are even, the g.c.d. of the a; is unity and Ya;C; = 0. We 
may then determine integers b;, such that” 

| Ge be ay b; | ay, b; 
= C; = —(C, = C3. 
a; bs , a3 bs de be 

In f'(21, 22, Zs), set 2) = ay, + bye (¢ = 1, 2,3). We obtain a primitive 
form $(y:, y2) = ay, + 2tyye + by?, where ab — t = Cd'.™ We havea = 
f'(a,, a2, a3). Since the leading coefficient of f’ is = 3 (mod 4) and a, is odd, az 
and a; are even, it follows that a = 3 (mod 4). Then $(y:, 2y2) is a properly 
primitive form whose discriminant is not a perfect square. Hence it represents 
an infinitude of primes. Since a2, a3 are even, the lemma follows. 

Let + be one of the infinitude of primes represented by f(x1, 272, 2x3) and 
not a divisor of d. Then there is an element ¢ = 2;(20; — 1) + 2rewe + 27s 


in ¥ such that? = —r. Since 2, is odd, @ contains \y = (1 + i)/2. The field 
F(7) is a canonical splitting field of % and % has a basis 1, 7, J, 77, where 7 = —d, 
y= —jf. 


Identify Hull’s Q, M, p, u, P, ac” with our A, G, 7, 7, F(i), —d respectively. 
Then @ has a basis in the form below,” if we note that @ contains d,, and hence 
Hull’s c = 1. 


m=1, mn = (—7+1)/2, wm=gA4+j), uw =gA4+)) E 1) + ‘| 
27q 

2. Cf. Hull, loc. cit., p. 11, lines 9-13. 

22 Dickson, Studies, p. 14, Theorem 16. 

23 Dickson, Studies, p. 11, Theorem 9. 

* Dickson, Studies, Theorem 27, p. 25 and Theorem 37, p. 32. 

% Loc. cit., Theorem 6, p. 7. 

6 Hull, loc. cit., p. 8 
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Employing Hull’s equations (16) and (17), p. 7, loc. cit., and setting b = 1 — 2h, 
we find 


P kigs m b. l as 
’ —_— “o / _—_—.— — — — 
v; = ko(1 + bt) /2 + i 3g! org ij. 


Since b is odd and ga is an integral element in F(7), it follows that G has a basis 
do, *** ,A3 as below. N(A;) is an integer and therefore the congruences (5) 
are satisfied. 

To prove the last sentence of the following theorem, let @ be the set of all 
elements Y2,A; with integral x’s. It may be verified that G has the properties 
R, C, U used in our definition of an integral set. Since the form @ = N(2z,A,) 
is obtained from N(x + yi + 2j + wij) by a transformation of determinant 
(47), it follows that the determinant of 2@ is d*. Hence @ is maximal. We 
have then 

TueoreM 4. Let G be an integral set in A. There is an infinitude of primes r 
such that for every + there is an clement i in ® such that (a)? = —r, (b) the field 
F(¢) is a canonical splitting field of A, (c) G contains (1 + 7)/2. For every such 
i, there is a canonical basis 1, 7, j, tj of A and G@ has a basis in the form 

b 1 


f “ ° G's ° ain 
(4) A =1, du = (1 + 2)/2, Ae = gj, As = a + 2g! ~ Org t); 


where a,b, g > 0 are integers such that 


(5) 4a°g° + d = 0 (mod 7), rb’ + 1 = 0 (mod 4g’). 


Conversely, if 1, i, j, ij form a canonical basis of A, * = —r, and if a, b, g are 
integers which satisfy (5), then \o, «++ , \s of (4) form a normal basis of an integral 
set. 

We shall show by an example that a canonical splitting field F(?) cannot 
always be chosen so that an integral set has a basis in the form (4) with g = 1. 
Let % be the algebra with a basis 1, J, J, IJ, where I’ = —83, J* ="—10, IJ = 
—JI. The fundamental number of % is d = 2-5-83.% It may be verified 
that the following elements form a basis of an integral set © in YM. 


wo = 1, w, = $(1+ J), w = 3J, a = 1-5 + Pi. 


It may also be verified that @ contains no element of trace zero and norm d. 
Hence there is no canonical splitting field F(/) such that @ has a basis in the 
form (4) with g = 1. 


6. Proof of Lemma 1. Let \o, --- , As of (4) be a basis of @. Since 2A, — 1 
= tis in G,, the intersection of @, with the set of all integral elements of F(:) 
is a ring with the conductor ¢ = lore = 2. Ife = 1, G, contains \, and, since 
T(\2) = TAs) = 0, OG, also contains As, \3. Hence @, contains @ and the 
lemma follows. We shall show that the case ¢ = 2 cannot occur. 


* This Journal, vol. 1 (1935), p. 435. 
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Suppose c = 2. By Hull’s result,” G, has a basis in the form 


th—7r+ ‘) 
29 ‘ 








a er ee ee a +9 


where X is a certain element in F(z) and g; is a positive integer. If fo, --- , 
is a normal basis of @,, the trace of each of the elements 2¢,; — 1, ¢2, &3 is zero. 
It follows that the double of every element of @, is in G. In particular, 2v2 is 
in @. Hence g divides g,. Since 2 is in @,, g: divides 2g. But by Hull’s 
Theorem 9 (loc. cit., p. 10), g: isodd. Hence g = g:. By Hull’s Theorems 7, 8, 
the lemma on p. 9 and (16) on p. 7, d and hk are odd. Then in his (17), ki = 
ke + 1 (mod 2) and we find 
mai $ Gi, n— Bara (wt 2g hott, 

where z, w are certain rational numbers. Since k; or ke is even and T(A3) = 0, 
the right member of one of these equations is in @ and hence is expressible as a 
linear function of Ao, A1, Ax with integral coefficients. But this is obviously 
impossible in the first case and, since A and b are odd, it is impossible in the 
second case. Therefore c # 2 and the lemma follows. 


7. Certain integral sets. Let J, J be elements of % such that 
(a) 1, J, J, JJ form a basis of YA; 

(b) I’ = —a, J? = —8, IJ = —JI, where a, 8 are integers, neither divisible 
by a square > 1 and a = B (mod 2). 

All the integral sets, finite in number, which contain J and J were determined 
by Darkow™ and the writer,” the former treating the case where a and 8 are 
even. Of course, a and 6 are not uniquely determined by &. Let G be an 
arbitrarily chosen integral set in &%. The question arises as to whether or not 
@ may be obtained from the above mentioned results by proper choice of I 
and J, i.e., whether or not @ contains elements J and J which satisfy the condi- 
tions (a) and (b). We shall show that it does contain such elements. 

Let Xo, «++ , A3 Of Theorem 4 be a basis of G@. G contains i = 2A, — 1, 


J = —ari + yo + rrr; = (rhx + 2g’y + xi)j/2g 


and 7.J, where z and y are rational integers to be determined later. We have 
J* = —do(z, y), where $(z, y) = rex” + rbry + g’y’, c = (rb? + 1)/4g°. The 


28 Loc. cit., p. 8. 

29 Determination of a basis for the integral elements of certain generalized quaternion 
algebras, Annals of Mathematics, vol. 28 (1926-27), pp. 263-270. 

30 Arithmetics of generalized quaternion algebras, American Journal of Mathematics, 
vol. 48 (1926), pp. 57-63. 
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discriminant of ¢ is —7 and hence it represents an infinitude of primes. There- 
fore, for properly chosen integers z and y, J* = —dp, where p is a prime not a 
divisor of 2dr. Then dp has no square factor > 1 and is prime to r. It may be 
shown that 1, 7, J, iJ are linearly independent. Hence they form a basis of YI; 
furthermore, 7J = —J1?. 

Let I = ior] = iJ according as d is odd or even. Then J and J satisfy 
all the conditions (a) and (b). Since @ contains J and J, it follows that it is 
one of the sets obtained by properly specializing the above mentioned results 
by Darkow or by the writer, according as d is even or odd. 


UNIVERSITY OF KENTUCKY. 








THE ERGODIC FUNCTION OF BIRKHOFF 


By Monroe H. Martin 


Introduction. In his New Orleans lecture, Professor Birkhoff' introduced 
the concept of the ergodic function Te) as the least time T which elapses before 
the point P of some motion can come within a distance ¢ of every point of the 
phase space. He is led to conjecture that in the general closed recurrent case 
possessing no stable periodic motions the ergodic function is of the order of 
e ‘"-» n being the number of dimensions of the phase space. The purpose of 
this paper is to consider the closed, transitive dynamical systems provided 
by the geodesics on topologically closed surfaces of constant negative curvature. 
The principal result is the establishment of upper bounds for the ergodic func- 
tions of these dynamical systems. 

The metric chosen for the phase space is patterned after that used by Morse.” 
The “time” T along a motion is taken as the “H-length” measured along a 
geodesic on the surface (see §9 of the present paper). 

If « be chosen in compliance with (67), (68), we find that the ergodic function 
T(e) satisfies an equality of the form 


T(e) << €° |4 log + o|, 


where A, B, C and w depend on the genus p of the surface as set forth in (77), 
(78), (79) and (80). All of these constants tend to + « for p— + ~, the con- 
stant w, in particular, being bounded by the inequalities (81), so thatw > 4 > 2 
for p = 2,3,---. Since n — 1 = 2 in the case under consideration, the order 
of T(e) as conjectured by Birkhoff lies well below that of the upper bound we 
have found. 

It may not be amiss to point out that the upper bound we have secured can 
doubtless be sharpened considerably, for in deriving it we compute an upper 
bound for the magnitude of the interval of time during which a point P of a 
certain motion comes within a distance ¢e of every point of the phase space at 
least once, the point P perchance coming repeatedly within a distance ¢ of some 
or all of the points of the phase space. In addition, the upper bound, (4p)°”, 
given in §10 for the number of sets S%, can in all probability be improved upon, 
with a resultant improvement in the upper bound for the ergodic function. 


Received August 23, 1935. This paper had its inception while the author was National 
Research Fellow at Harvard University in 1933. 

1G. D. Birkhoff, Bull. Amer. Math. Soc., vol. 38 (1932), pp. 375-377. 

2M. Morse, Jour. de Math., vol. 14 (1935), pp. 52-53. This paper will be referred to as 
Morse I. 
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1. Preliminary notions. Let (x,y) denote ordinary Cartesian coérdinates 
in the plane. Let W denote the unit circle z° + y’ = 1, and let the interior of ¥, 
provided with the metric 


(H) ds? = 4(1 — 2” — y’) “(dx + dy’), 


be denoted by #. The Gaussian curvature of this metric equals —1, so that 
may be regarded as the hyperbolic plane of non-euclidean geometry. Linear 
fractional transformations of hyperbolic type on the complex variable x + ty 
which take the interior of into itself leave (H) invariant, and are called H- 
transformations (hyperbolic transformations). The geodesics on # are circular 
ares orthogonal to VW and ending upon it. They are called H-lines. Through 
two points on ® only one H-line can be drawn. Consequently if two H-lines 
intersect in more than one point, they coincide. Two H-lines which have no 
common point either on ® or on W are said to be non-intersecting. A segment 
of an H-line which has both end-points A, B in @ is an H-line segment, and is 
denoted by AB; a segment which has one end-point in @ and one on ¥ is an 
H-ray. The H-line of which a given H-line segment is a segment is, at times, 
conveniently referred to as the H-line segment produced. If A, B are two dis- 
tinct points on ¥(#), there is exactly one H-line (H-line segment) joining them. 
On directing this H-line (H-line segment) from A to B, we obtain a directed 
H-line AB (directed H-line segment AB) for which A is the a-end-point and B 
the w-end-point. If an H-line (H-line segment) and a segment of itself are 
directed in the same sense, the (latter) directed H-line segment is a segment 
of the (former) directed H-line (directed H-line segment). A directed H-line 
(directed H-line segment) intersects a set of directed H-line segments if one 
of the directed H-line segments in the set is a segment of the directed H-line 
(directed H-line segment). A set S,; of directed H-line segments is a subset 
of a set S of directed H-line segments (directed H-lines) if every member of S, 
is a segment of some member of S. 

The length of an H-line segment AB calculated in the metric (H) is its H- 
length, and is denoted by | AB|. The H-distance between two points A, B of ® 
is then defined to be | AB |. When an H-line segment is designated by a single 
letter, say n, its H-length is designated’ by | n|. The H-mid-point of an H-line 
segment is defined to be that point of the H-line segment which divides it into 
two H-line segments of equal H-lengths. Angular magnitudes measured in 
the metric (H) equal the ordinary Euclidean magnitudes. No distinction will 
therefore be made between the two. If A is a point not lying on a given H-line 
Land B the point of 1 such that AB meets I at right angles, AB is the H-per- 
pendicular let fall from A tol. | AB! is the H-distance between A and Il and is, 
as a matter of fact, the minimum H-distance between A and the points of I. 
If two H-lines 1, m are non-intersecting, they possess a unique mutual H- 





3 At times, we also employ the symbol | | in its customary usage, that of denoting abso- 
lute values. No confusion need arise, since the proper interpretation will be clear from 
the context. 
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perpendicular, the H-length of which is denoted by | lm |, and is called the H- 
distance between land m. |lm| is then the minimum H-distance between the 
points of l and m. 

If A, B, C are three non-coincident points of such that two of the H-line 
segments AB, BC, CA meet at right angles, say BC and CA, the following rela- 
tions* hold: 


sinh | AC | 
sinh | AB|’ 


AC | cosh | BC], 


where A denotes the angle CAB. If A, B, C are three arbitrary points of ®, 
the following “triangle” inequality holds: 


iio sinh | BC | 
(1) ~ sinh | AB|’ 


cosh | AB| = cosh 


cos A = cosh | BC | 





| AB| = | BC| + {CA}. 


2. Gaussian geodesic, and geodesic polar codrdinates. Let n be a given 
directed H-line segment. By a Gaussian geodesic coérdinate system [u, , v,] on ®, 
we mean a codrdinate system on © in which the coérdinate lines v, = const. 
are H-lines orthogonal to the directed H-line of which n is a segment, and the 
coérdinate lines u, = const. are the orthogonal trajectories of the coérdinate 
lines v, = const. The origin [0, 0] is taken as the a-end-point of n. The posi- 
tive sense on u, = 0 is chosen to coincide with that on n and the sense on v, = 
const. is then affixed in the usual way, i.e., so that an observer moving in the 
positive sense along u, = 0 finds the positive sense of v, = const. directed to 
his right. A unique coérdinate system [u, , v,] is thereby associated with any 
directed H-line segment n. The coérdinate lines u, = const. are circular arcs 
in ® which connect the end-points of u, = 0. They cut off equal H-lengths 
upon the codrdinate lines v, = const. If we take u, as the H-length of are 
measured along v, = const. from u, = 0, and v, as the H-length of are measured 
along u, = 0 from [0, 0], the differential form (H) becomes’ 


(G) ds’ = du, + cosh’u,dv’*, 
in the [u,, v,] codrdinate system. Hence the H-length «x of the segment of 
u, = k comprehended between v, = 0 and v, = | n | is given by 
(2) k= [ cosh kdv, = | n| cosh k. 
0 


Let P, P’ be two points of u. = a. Denote by o the H-length of the segment 
PP’ of u, = a. Let a, a’ denote the directions (measured in the usual way in 


‘See, for example, H. S. Carslaw, The Elements of Non-Euclidean Plane Geometry and 
Trigonometry, London, 1916, p. 109. 

5 See, for example, W. Blashke, Vorlesungen wber Differentialgeometrie, vol. 1, 1930 
pp. 155-156. 
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the x, y-plane) of the coérdinate lines v, = const. at P, P’ respectively. We 
proceed to calculate an upper bound for | a — a’ | when P, P’ both lie in the 
region 


of @. Since the coérdinate lines v, = const. are orthogonal to u, = a, it is 
sufficient to obtain an upper bound for the difference | 8 — 8’ | in the directions 
8, B’ of u, = a at P and P’ respectively. 

Let us assume (with no loss of generality) that u, = 0 is drawn orthogonal 
to the positive z-axis of the x, y-coérdinate system at a point which lies at an 
H-distance ~ from the origin of the x, y-codrdinate system. The equation of 
u, = @in 2, y-codrdinates is then 


2 cosh & sinh € + sinha _ 
sinh a — sinh & sinh é — sinha — 





ry + 
Letting K denote the ordinary Euclidean curvature of this circle, we find 


|sinh a — sinh €| 





[ai= cosh a 
Now 
\dp! \dp| |\ds’| ,, |ds’| 
\ae | ™ |ae| |ae | = !*! a} 





where s, s’ denote the H-length and Euclidean length of are respectively, meas- 
ured along u, = a. From (H), we have | ds’/ds| < 3, so that |d8/ds| < 
3|K |. Therefore 


a-a'ls | |Bla<4iKlo 


and hence we have from (2), since ¢ < | n | cosh a, 


— | n | 
|B B’| <> 





sinh a — sinh €|, 


| K | being replaced by its value given above. Since —5 S a S 6, —§ = O and 
ja — a’| = |B — B’ |, we find that 


(3) la-a'’|< > (sinh 6 + sinh £), 


which is the required upper bound. 
The set of H-rays concurring at a point O of combined with their orthogonal 
trajectories form a geodesic polar coérdinate system [r, ¥] on ®. ris the H-length 
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measured along an H-ray from O, and y is the angle at O measured from a fixed 
direction through 0. In these codrdinates (H) becomes’ 


(P) ds’ = dr’ + sinh r'dy’. 


The point O is called the pole. 

A coérdinate line r = const. will be called an H-circle and O is its H-center. 
The H-line segments connecting the points of an H-circle to its H-center are its 
H-radii. The two halves into which an H-circle is divided by an H-line through 
its H-center are H-semicircles. An H-circle is an ordinary Euclidean circle. 
However, its Euclidean center coincides with its H-center only when the H-center 
lies at the center of WV. 


3. Preliminary lemmas. In this section we prove a number of elementary 
lemmas. 


Lemma 1. If BC and AD are two H-line segments perpendicular to the H-line 
segment CD at its end-points, and if the H-line segment AB meets BC at right 
angles,’ then 


(4) sinh | AB | 
(5) cosh | CD | 


sinh | CD | cosh | AD |, 
sin Z BAD cosh | AB |. 


Draw the H-line segment AC. (4) follows from (1) and the relation cos Z 
ACD = sin Z ACB. 

From (1) we have cosh | AC | = cosh | AB| cosh | BC | = cosh | AD | cosh 
(CD |, and in addition, since sin Z ACD = cos Z ACB, there results sinh 
| AD| = cosh | AB| sinh | BC |. (5) follows when these relations are used 
together with (1) and (4) to simplify the expression for sin Z BAD, obtained 
when sin Z BAD is expressed in terms of sines and cosines of Z CAD and 
Z CAB. 

Let n be a given directed H-line segment and [u,, v,] be the associated Gaus- 
sian geodesic coérdinate system. 4, p denote positive numbers and C;, Cy 
are two H-semicircles of H-radius p drawn in the region of ® for which v, 2 0 
about the points [+(A: + p), 0] as H-centers. 

Lemma 2. If C is an H-semicircle of H-radius p drawn in the region v, 2 0 


of ® about a point on the segment —(\y + p) S Un S A. + pofv, = 0 as H-center, 





V 


it is intersected by any H-line | which intersects both Cy, Cy. 
First, suppose [0, 0] coincides with the center of ¥. The codrdinate lines 
u, = 0,v, = 0 are then perpendicular diameters of ¥. Consider the H-line t, 


6 See, for example, L. P. Eisenhart, A Treatise on the Differential Geometry of Curves and 
Surfaces, New York, 1909, pp. 207-209. Formula (P) above is not given explicitly here, 
but may be readily obtained from the formulas to be found on the pages indicated. Asa 
matter of fact, (P) is a well-known formula for the are length in non-Euclidean geometry. 

7 The H-line segments BC, AD, CD, AB form a figure which is sometimes called the tri- 
rectangular quadrilateral. See, for example, D. M. Y. Sommerville, Non-Euclidean Geom- 
elry, Chieago, 1919, p. 70. 
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drawn tangent to both ce. Cy. In order to prove the lemma, it is sufficient to 
prove it when I coincides with ¢,;. The H-distance from a point of the segment 
—(A. + p) S uw S i + p Of v, = O to t, is seen from (4) to be a monotone 
increasing function of its H-distance from [0, 0], and as such takes its greatest 
value, namely p, at the end-points of this segment. Since the H-center of C 
lies on this segment, C meets t, . 

When [0, 0] does not coincide with the center of V, an H-transformation 
carrying [0, 0] into the center of VW reduces this case to the one above. 

Lemma 3. If C’, C” are two H-semicircles of H-radii p drawn in the region 
v, 2 0 of ® about points on the segment —(\. + p) S ua S AL + pofv, = Vas 
H-centers, the set L of H-lines intersecting C’, C’’ contains the set L, of H-lines 
intersecting two arbitrarily chosen H-radii of C ep 

An H-line of L, intersects both C;, CY’. According to Lemma 1, it intersects 
both C’, C’”’ and consequently belongs to L. 

Lemma 4. The interior of the region of ® enclosed by v, = 0, two arbitrary radit 
of Ci, Cy, and the H-line segment connecting their end-points on Ci, CY cannot 
contain an H-circle of H-radius p/2. 

The proof of this lemma is simple and is omitted. 

Lemma 5. If C’, C” are two H-semicircles of H-radius p drawn in the region 


v, = 0 of ® about points [+(A + p), 0] (A > 0) as H-centers, and if n is such that 
v, = | n| ts tangent to both C’, C’’, we have | n | bounded by the inequalities 
(6) pe” < |n| < 2e-™ sinh p. 


Fixing our attention on one of the H-semicircles, we let A denote its H-center, 
B its point of tangency with v, = ||, C the point [0, | n |], and D the point 
{[0, 0]. According to (4), we have 
sinh p = sinh | n | cosh (A + p), 
since | AB| = p,|CD| = |n|,|AD| = X + 1, and the upper bound in (6) 
follows at once from this equation, since 
sinh |n| > |n|, cosh (A + p) > }e°*??, 
To obtain the lower bound in (6), consider the segment of u, = + p taken 
between v, = Oandv, = |n|. According to (2), the H-length of this segment 
is |n| cosh (A + p). Now 
p = | AB| <|n| cosh (A + p), 
since AB is the H-perpendicular let fall from A tov, = |n|. The lower bound 
given in (6) is then obtained from this inequality, inasmuch as | n | cosh (A + p) 
<|n|e*. 
Lemma 6. When the H-line segment AB lies in the region 
Tet 0 <u, S 7, 0s, S/n} 
of and A lies on u, = 0, we have 
|AB| <|n|+r. 
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A 


If AB lies in the region —p S un S 7,0 Sv, S | n|, with A onu, = —pand 
Bin T,, , we have 


|AB|<|n|+utr. 


For the first part of the lemma, take C on u, = 0, so that its v,-codérdinate 
equals that of B. Now 


|AB| < |AC|+|CB| s|n| +7, 


which prove» ‘his part of the lemma, inasmuch as the equality signs cannot hold 
simultaneously. 

To prove the second part of the lemma, we take C as before and again have 
|AB| s | AC | + | CB), and since | AC | < |n| + 4, |CB| S r+, the second 
part of the lemma is proved. 

Lemma 7. When sinh (| n|/2) < tanh 6, an H-circle of H-radius 5 can be 
inscribed in the region T,, of Lemma 6 if r = 1, , where 
4 sinh 6 

[m| 
Let A be a point on the positive half of v, = |n|/2, B be the foot of the 


H-perpendicular let fall from A to v, = |n|, C be the point [0, | n |], and D 
be the point [0, | n |/2]. An H-circle of H-radius | AB| described about A 


(7) Tr, = 6+ log 








as an H-center is tangent to both® », = 0, v, = | | and will, in addition, be 
tangent tou, = Oif |AD| = | AB|. If, however, |AD| > | AB|, the H- 
circle lies in the region 

(8) 0<u, S|AB|+|AD\|, 0s», S\|n\, 

of . 


From (4) we have 
sinh | AB| = sinh (| n |/2) cosh | AD | < tanh 6 cosh | AD |, 


and therefore the condition that | AD| > | AB| is met, when | AB| = 6. By 
way of obtaining an upper bound for | AD |, we have 


sinh 6 








h|AD| = —— = 
oath | ‘sinh (| {/2)’ 
and therefore 
sinh 6 + [sinh’ 5 — sinh? (| n | /2)}" 
AD| =1 ; ’ 
naa og { sinh (|n|/2) 
so that 
2 sinh 6 4 sinh 6 
AD log ——— x < log ——— 
|AD| < log oh (in/2) <8 Ta 
8 To show that it is tangent to v, = 0, let B’ be the foot of the H-perpendicular let fall 
from A to v, = 0, C’ be the point [0, 0]. From (4) we have, since | C’D | = | CD |, the 


result sinh | AB’ | = sinh | C’D |-cosh | AD | = sinh | CD | cosh | AD | = sinh | AB |. 
Hence | AB’ | = | AB | , which completes the proof. 
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The upper bound in the first inequality in (8) may then be replaced by 6 + log 
4 sinh 6 
i 

Lemma 8. The H-length of the mutual H-perpendicular n between two non- 
intersecting H-lines l, , l. drawn tangent to an H-circle T of H-radius y is given by 





This proves the property stated for 7, in the lemma. 


(9) |n| = 2 log {sin @ cosh y + (sin’@ cosh*y — 1)'}, 


where 26 [are sin (sech y) < 6 S 1/2] is the angle between the H-radii of Y drawn 
to the points of tangency. 

Let A be the H-center of T, B be the point of tangency of I with 1, , C the 
end-point of n on l,, and D the H-mid-point of n. From (5) we have 


cosh (| n |/2) = sin @ cosh y, 


since |CD| = |n|/2, ZBAD = 6,| AB| = y, and (9) follows from this equation. 
When 0 S @ S arc sin (sech y), the two H-lines |, , l, intersect. 
Lemma 9. Let a, a2, b;, be be four H-lines each making an angle 6 (—72/2 < 
6 < 2/2) with an H-line segment AB. Let those H-lines labeled with a intersect 
at A, and those labeled with b intersect at B. Choose @ so that no line labeled with 
a intersects a line labeled with b. The four H-distances 


| dibs |, | dibs |, | aaby |, | debe | 


resolve into two pairs such that the H-distances in each pair are equal, the greater 
H-distances occurring for the pair in which the H-lines are placed so that their 
mutual H-perpendicular intersects AB. 

The H-distances from the H-mid-point O of AB to a,, a2, b; , be , are all equal 
to one another, so that these four H-lines are all tangent to the same H-circle 
with H-center at O. The proof of this lemma is then readily seen to follow from 
(9) in Lemma 8. 


4. The domain D,. Let p denote a positive integer greater than unity and 
let p, 5 (0 < p < 8) be defined by’ 


(10) cosh 2 = cot — , 
2 P 
(11) cosh ° =o ot —. 
2 4p 
In ® construct the H-circles 
(12) ry = tanh’, 
(13) 2 +y° = tanh’ i. 


® I am indebted to the referee for the elegant form of these equations. 
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having H-radii p/2 and 6/2 respectively, and beginning with the point on (12) 
where it intersects the positive z-axis, divide the circumference of (12) into 4p 
equal ares. At each of the 4p division points draw an H-line tangent to (12), 
thereby forming a curvilinear polygon” having 4p sides and 4p vertices. We 
denote the interior of this curvilinear polygon by Do and label its sides 


(14) €1,@2,°°* 5 Cap, 


¢e; being taken as that side of Dp which is tangent to (12) where it cuts the posi- 
tive z-axis, and the subscripts increasing when the boundary of Dp is traversed 
counter-clockwise. The vertices of Do are labeled 


(15) Va ¥05 *** 5 Ves 


V, being the vertex at which e; and e: concur, and the subscripts increasing 
when the boundary of Do is traversed counter-clockwise. The vertices of Dy 
all lie on (13). The H-cirele (12) is inscribed in Do , which in turn is inscribed 
in (13). The H-length of each side of Do equals p and the interior angle at 
each vertex equals +/2p. 

V;and V;_, (Vo = V4,) each divide the side e; of Dp produced into two H-rays. 
The H-ray ending on V; (V;_;) which does not contain the side e; of Do is de- 
noted by e; (e;). It is assumed that V; (V;-1) is a point of e; (e;). The region 
of & lying outside of (13) is divided into two subregions by e741 , e}-1 (@spaa = @1, 
€) = ¢). The subregion bounded on W by the shorter (Euclidean) are when 
taken with its boundary is denoted by £;. The region of #, lying outside of 
(13) and between E; and Fj.; (Ey,.; = E;) at V; , when taken with its boundary 
is denoted by E;. With respect to the region E; , we prove the following lemma. 

LemMMA 10. The boundaries ej. , e;-. of E; cannot be joined by an H-line seg- 
ment lying in E; . 

The proof of this lemma is elementary, but because of the importance of the 
lemma in subsequent arguments the proof is given in some detail. From sym- 
metry considerations it is clear that we need consider only the case 7 = 1. 

In the x, y-coérdinate system the equation of the circle containing the side 
e2 of Do as a segment is 


ry —-2 4 cos o cos —- x — 2 cos — VV sec — sin— y +1 =0. 

2p 4p 4p 2p 2p 
The minimum value of y on this circle is caleulated to be sin r/(4p) cos /(2p), 
occurring, as a matter of fact, at V;. Consider the H-line | drawn tangent to 
(13) at the point where it cuts the positive z-axis. | is a segment of the circle 
whose equation is 


Tv 


. e a 
rty — 4/sec —(1 + cos = 0, 
r+y V seo = ( L co z)e41 


1° That the H-lines so constructed intersect in pairs is well known. This fact may 
also be drawn from (10) in connection with the remark at the end of Lemma 8 [ Referee}. 
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and lies exterior to (13). The maximum value of y on l is tan'r/(4p) and it is 
readily seen that 





tan’ 2 < sin £ 4/005 ¥ (p = 2,3, ---), 
so that | cannot intersect the side e2 of Dy produced, the situation being analogous 
in regard to the side e,, of Do. J therefore lies in E, . 

Suppose an H-line segment lies in EZ, and connects e2, ei). Since 1 connects 
(13) to W, this H-line segment meets l. It cannot, however, be a segment of l 
and therefore when produced intersects Y in two points, one in each of the ares 
into which WV is divided by the end-points of 1, so that it cannot intersect both 
€2, €i», Which is a contradiction. The lemma is therefore true. 

5. The group G and the net N. When certain of the points on the boundary 
of Dy are adjoined to Dy , a fundamental domain Dy is obtained for a well-known" 
Fuchsian group G. Transformations of G which take the sides of Do into one 
another in the following manner: 


€y — €3, C4 — C2, C5 —> C7, Cg —> Op, -+* 5 Cap_3 — Cap-_1, Cap — Cape, 


form a set of generators for G. The transformations of G are all H-trans- 
formations. , 

Two point sets in ®, or two sets of directed H-lines, or two sets of directed 
H-line segments which can be transformed into one another by transformations 
of G are congruent, or copies of one another; two symbols forming a pair above 
are congruent symbols. The copies of Do are meshes and their totality covers 
® without lacunae. The copies of the sides (vertices) of Do are the sides (vertices) 
of the meshes. Two meshes with a side in common border upon the common 
side. Those meshes which border Dy upon e:, é2,--- ,@s» are denoted by 
D,, Dz, --- , Dy respectively. Two vertices which are end-points of the same 
side of a mesh are adjacent vertices. Two sides of a mesh concurring at a vertex 
of the mesh are adjacent sides. Two sides of a mesh which are not adjacent 
sides are non-adjacent sides. Two non-adjacent sides of a mesh adjacent to 
the same side of the mesh are alternating sides. Alternating sides, as well as 
adjacent sides, always belong to the same mesh. When two meshes border 
upon a common side, this common side is adjacent to two pairs of alternating 
sides, one pair being found in each mesh. If a side be selected from each of these 
pairs of alternating sides so that the two sides selected do not concur, the two 
sides thus selected are opposite sides. Opposite sides do not belong to the same 
mesh but belong to meshes which have a side incommon. Neither non-adjacent 
sides nor opposite sides intersect, even when produced. 

The H-length of each side of a mesh equals p. Any given mesh is inscribed 


11 See, for example, M. Morse, Trans. Amer. Math. Soc., vol. 26 (1924), pp. 25-32. This 
paper will be referred to as Morse Il. See also, J. Nielsen, Acta. Math., vol. 50 (1927), 
pp. 191-224. 
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in an H-circle of H-radius 6/2 and its 4p sides are all tangent to an H-circle of 
H-radius p/2. The interior angle at each vertex of the mesh equals +/2p. 

The sides of the meshes form a network N of H-line segments. The end- 
points of the sides in N are the vertices of N. The sides in N align themselves 
into H-lines, 2p of these H-lines concurring at each vertex of N. A vertex of N 
therefore serves as a vertex for 4p meshes. We label each side of a mesh with 
the label assigned in (14) to that side of Dp into which it is carried by the trans- 
formation of G carrying the mesh into Dy. To each side of a mesh there is then 
assigned two of the symbols in (14), 6ne coming from each of the meshes border- 
ing upon the side in question. N, together with the labeling of its sides, is 
transformed into itself by a transformation of G. 

The two points in which the side e; of Do produced intersects V are the base- 
points of e;. The base-points of e; divide V into two ares of unequal euclidean 
length and e; subtends the shorter are (end-points included). Let [e,;] denote 
the are on VW which e; subtends. Adjacent sides of Do subtend overlapping 
ares on W and the part of [e;] remaining when the overlapping parts are re- 
moved is denoted by [e;],. Obviously [e;], and [e;], have no common point, 
unless i = j. The transforms of the ares [e;], [e:], (¢ = 1, 2, --- , 4p) by the 
transformation which carries Dy into one of D,, De, --- , Dy, say D,, are 
denoted by [e,]“”, [e,]\” respectively. If we follow the conventions used above, 
the side e; of D, subtends the are [e;]“’ on V and the end-points of [e;]“ are the 
base-points of the side e; of D, . 

We now prove four lemmas. 

Lemma 11. The H-length of an H-line segment s whose end-points lie on two 
non-adjacent sides, or on two opposite sides, cannot fall below the constant x, where 


} 
=. oe 
(16) 5 x=2 lon {2 cos ro + (4 cos ; 1) \, 


and is actually greater than x in the latter case. 

First, suppose the end-points of s rest upon two non-adjacent sides. | s | 
cannot fall below the H-distance between the two non-adjacent sides produced. 

If we identify these two non-adjacent sides produced as 1,, lz in Lemma 8 and 
the H-circle of H-radius p/2 to which they are drawn tangent asT,, the H-distance 
between two non-adjacent sides produced is readily seen to be least when the 
two non-adjacent sides are alternating sides, inasmuch as | n | in (9) is a mono- 
tone increasing function of @in the given interval. Let x denote the H-distance 
between two alternating sides produced. We have |s| 2 x, and in order to 
obtain the value of x given in (16), we place 6 = x/2p, y = p/2 in (9), p being 
taken as given in (10). 

Second, suppose the end-points of s rest upon two opposite sides. According 
to the definition of opposite sides given above, the end-points of s lie upon two 
sides of N which do not belong to the same mesh but to two meshes bordering 
on a side, say AB of N. One of the opposite sides emanates from A and the 
other from B, the angles measured from AB to the sides being simultaneously 
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7/2p or —x/2p. The two opposite sides produced may now be interpreted 
as a pair selected from the four H-lines a, , az, b; , be in Lemma 9; one of the 
opposite sides produced will be labeled with a, the other with b. From Lemma 9 
it follows that the distance between two opposite sides produced exceeds the 
distance between two alternate sides produced. Hence when the end-points 
of s rest on two opposite sides, | s | exceeds x. 

Lemma 12. If k denotes any fixed positive integer taken from 1, 2, --- , 4p, 
exactly 4p — 3 sides of the mesh Dy have both their base-points in the interior of 
lecli. If these 4p — 3 sides are taken in their circular order around Dy, , consecu- 
tive sides are adjacent, except two, between which the three remaining sides of Dy 
intervene. 

For simplicity we shall prove this lemma for p = 2, the proof permitting an 
obvious extension to any value of p. In addition we may restrict ourselves 
with no loss in generality to the case k = 1. The meshes Dy and D, border 
along a side of N which when reckoned to Dy (D,) is labeled e; (es). If we proceed 
from this side in a counter-clockwise fashion around the periphery of D, , the 
labels of the sides of D, when reckoned to D, are met in order e3, es, @s, €6, €7, €8, 
€1, C2, @3. 

We now prove that the sides es, es, ¢7, és, €: of D, have both base-points in 
the interior of [e:], . 

Let us begin with e;. Suppose both base-points of e; do not lie in [e,], . 
Since the sides es , e; of D, when produced cannot intersect, the only way this 
can occur is for the side es of D, produced to meet the side es of Dy produced. 
On interpreting the H-line obtained by producing the side e, of D, as a trans- 
versal which cuts across the two H-lines obtained by producing the side es of 
Dy and the side e; of D, , we easily see from elementary non-Euclidean geometry 
that the latter two H-lines cannot intersect. This follows from the fact that 
the sum of the interior angles lying exterior to D, is for any p equal to (2 — 
3/2p) = (2 — 3/4) > x. 

In like manner it may be shown that the base-points of the side e; of D, lie 
in [ei], . 

The truth of the statement for the remaining sides e¢, e7, es of D, is now 
obvious. 

Let 2 denote an are on ¥ which is the sum of 4p — 4 consecutive [e ;]’s and let 
A = ¥—@Q. When Dy is transformed into a given D; (i = 1, 2, --- , 4p) by a 
transformation of G, the transform of A under this transformation will be de- 
noted by A“. 

Lemma 13. There are at least two different values of i such that [e;], > A“. 

In view of the symmetry of Dy , it is sufficient to prove the lemma when 


A = [ess], + [espe] + [ears] + [esp], , Q= )» [e,]. 


Here A C [esp-s] + [€s»-2] + [esp>—1] + [esp], so that [e,], D> A‘ whenever each of 
the sides ey)-3 , @4p-2 , Ctp-1, sp Of D; has both base-points in [e;], . 
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According to Lemma 12, 4p — 3 consecutive sides of D; have both base- 
points in [e;],. From these 4p — 3 sides 4p — 6 combinations of four consecu- 
tive sides each can be formed. In all the meshes D,, D2, --- , Dy, there are 
4p(4p — 6) such combinations, not all different to be sure, but any one com- 
bination occurring as often as any other. Since 4p different combinations of 
four consecutive sides are possible, any one combination such as @4p~3, €4p—2, 
€4p-1, C4p OCCUTS 4p(4p — 6)/4p = 4p — 6 times among the 4p(4p — 6) com- 
binations. Now a given combination of four consecutive sides occurs only 
once in a given D; , so that the combination ess , @4»—2 , C4p—1 , €4p OCCUrS in the 
desired manner in 4p — 6 different D,’s. The number of values of 7 such that 
[e,], DA” is at least 4p — 6 = 2, since p = 2. 

Lemma 14. When sinh (| n|/2) < tanh 6 and if r = r,, where r, is given in 
(7), the region T,, in Lemma 6 contains at least one entire mesh, together with the 
H-circle in which it is inscribed. 

A region of @ comprising the interior and boundary of an H-circle of H-radius 
6 drawn about an arbitrary point of ® as H-center contains at least one entire 
mesh together with the H-circle in which it is inscribed.” In view of this fact, 
and Lemma 7, this lemma is obvious. 


6. Admissible sequences. Let A, B be two points of ®, not lying on N, such 
that the H-line segment AB connecting them intersects k sides of N without 
passing through a vertex of N. The directed H-line segment AB generates 
a sequence of the e’s in the following way: beginning at A, we proceed along 
AB until we mect a side in N labeled ay, say, when the side is reckoned to the 
mesh we are just leaving; after leaving this mesh, we continue along AB until 
we meet a second side of N labeled, say, a2, when the side is reckoned to the 
mesh we are just leaving; etc. The process ends when B is reached and yields 
a certain sequence 


(17) Q,Q2 +--+ Api; 


of the e’s which is the admissible sequence of AB. The admissible sequence of 
the oppositely directed H-line segment BA is then 


, , , , 
A, Ap-1 +++ 220), 


where a;, ai-1, «++ , @2, @; designate symbols congruent to ax, ax-1, «+ , a2, a 
respectively. 

If AB is a segment of A,B, and if they meet N at the same points, their 
admissible sequences are identical. Here neither AB nor A,B, is supposed to 
meet a vertex of N or to end upon N. The admissible sequence of AB where 
A and B are points of N but not vertices of N will be defined to be the admissible 
sequence of A, B,, where A;, B, are two points not lying on N, such that A,B, 


12 Let P denote an arbitrary point of @. P necessarily lies in at least one mesh, which 
in turn is inscribed in an H-circle f of H-radius 6/2. An H-circle of H-radius 6 taken about 
P as an H-center will contain I. 
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contains AB as a segment, and meets N only at its intersection points 
with AB. 

For brevity we denote the side in N which gives rise to a; (¢ = 1, 2, --- , k) 
in (17) by a;. The segment of AB comprehended between two sides ai,, ai 
(t; < %) of N is conveniently denoted by [a;, a;,]. 

We now prove three lemmas related to admissible sequences. 

Lemma 15. If A, B are two points on N, not vertices of N, such that AB does 
not pass through a vertex of N, and if the admissible sequence of AB contains 
2pr + 1 symbols, where r is a positive integer, | AB | is bounded from below by 


2 


(18) rx = | ABI, 
where x is defined in (16). 
Let 
(19) G12 +++ AepAepir +++ Aepy41 epee +++ Aap(r4t) A2p441 +++ Aepr4t 


be the admissible sequence of AB and pick out from it the subsequences 
A2py+1 A2pr42 ++ * Aep(v+t) A2p(r41)41 (v = 0, 1, ss oa 1) 


each containing 2p + 1 symbols. The proof of the lemma is achieved by show-» 
ing that none of the H-lengths 


| [@2pr41 G2p(41)+:) | (v — 0, 1, cP 1), 


can fall below x. Let us prove this for » = 0, the method of proof being the 
same for the remaining values of ». Consider the subsequence aja - - - 2,421. 
The end-points of [a;a2,,;] lie on N but are not vertices of N. 

If at least one of the 2p segments into which [a;a2,,;] is divided by the 2p 
meshes through which it passes has its end-points on non-adjacent sides, we 
have | [a:@2)4:] | 2 x by virtue of Lemma 11. 

If, however, each of these 2p segments has its end-points on adjacent sides, 
we proceed as follows: We first note that two sides in N from which two consec- 
utive symbols in a; ds - - - @2p@2p41 arise meet at a vertex of N. Let V, V®,..-, 
Vv” denote the vertices (some of which may coincide) where the sides a; , a; 
2, @3 3 ++- 32), A2p41 Of N respectively concur. At least two of these vertices 
do not coincide. For suppose they all coincide. The 2p meshes through which 
(a; @2»41] passes ‘then possess V as a vertex and their interior angles at V” 
would exhaust a straight angle at V™. [a,q@2)4;] intersects both sides of this 
straight angle, and therefore intersects an H-line passing through V“” in two 
distinct points, thus coinciding with a segment of this H-line. This requires 
that [a,a2):] pass through a vertex of N, contrary to our assumption for AB. 
Hence all the vertices cannot coincide. Let V (1 < j < 2p) be the first 
vertex which does not coincide with V®. V™ and V™ are adjacent vertices 
and are the end-points of the side a; of N. Now consider the H-line segment 
[a;_,a;,:]. The end-points of this H-line segment lie on opposite sides” and 


‘3 For the definition of opposite sides see §5, or the proof of the second part of Lemma 11. 
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therefore from Lemma 11, we have | [a;:4;,:]| > x, from which it follows 
a fortiori that | [a;aep41] | > x. 

Lemma 16. If A, Bo, A,B, are two directed H-line segments such that Ao, Ai 
lie on the same side in N, and By, B, lie on the same side in N, and if neither of 
A, B,, A, By meets a vertex of N, their admissible sequences contain the same number 
of symbols. 

Let a (b) denote the side in N on which rest Ag, A; (Bo, By) and let a (8) de- 
note the H-distance along a (b) measured from one of the end-points [arbitrarily 
fixed] of a (b). A directed H-line segment with a-end-point on a and w-end-point 
on b determines a pair a, 8 uniquely, and vice versa. Let ao, Bo be the pair 
determined by A,B, and a, 8; the pair determined by A,B,. The directed 
H-line segment A,B, determined by the pair a , B;, where 


a, = ay + ta, — ag), B: = Bo + t(Bi — Bo) (0st 1), 


is deformed from A, B, into A,B, when ¢ ranges from 0 to 1, and its end-points 
A, , B, are displaced along a, b respectively, without meeting a vertex of N. 

Suppose A, B, intersects N in k points. None of these points is a vertex of 
N and therefore A,B, intersects k H-lines in N. Let P denote an intersection 
point of A,B, with one of these k H-lines, say with the H-line 1. During the 
deformation of A,B, from A, B, to A, B, the point P moves along both A, B, and l. 
At no time can P pass off the end-points of 1, since they rest on ¥. Likewise 
it cannot pass off the end-points of A,B,, since this would require that either 
A, or B, coincide with a vertex of N; which is contrary to hypothesis. -Con- 
ceivably the point P might be lost by A, B, and I becoming tangent to each other. 
This, however, cannot occur for it would imply that A,B, is contained in 1 
and therefore that A, , B, are vertices of N. At the conclusion of the deforma- 
tion, A, B, therefore intersects N in at least k points, none of which can coincide, 
since A, B, does not meet a vertex of N. On interchanging the réles of A, Bo and 
A, B, in the deformation, we see that A,B, and A,B, intersect N in the same 
number of points. The number of symbols in each admissible sequence is 
therefore equal to k. 

Lemma 17. The H-length of an H-line segment A,B, which terminates on the 
same sides of N as does the H-line segment AB in Lemma 15 cannot fall below rx. 

If A,B, does not pass through a vertex of N, according to Lemma 16 the 
number of symbols in its admissible sequence is the same as the number of 
symbols in the admissible sequence of AB. Hence, from Lemma 15, | A1B; | 
= rx. 

This result holds, even if A; B, passes through a vertex of N. Note that there 
is a one-to-one correspondence between the points of the square 0 S a@ & p, 
0 < B S pin the a, B-plane (where a, 8 are taken as introduced in the proof of 
Lemma 16) and the directed H-line segments’ A,B, which have an end-point 
upon each of the above sides. The points in this square which correspond to 
H-line segments passing through a vertex of N are limit points of the set of 
points in the square corresponding to H-line segments which do not pass through 
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a vertex of N. Now the H-length of A, B, is a continuous function of the argu- 
ments a, 6; , so that the H-length of A,B, in any position in which it passes 
through a vertex of N cannot fall below rx. 








7. Orthogonal families of directed H-lines. Let n’ be an arbitrarily given 
directed H-line segment. Following §2 we adopt the coérdinate system |[u,:, 
v,’] on ® associated with n’. The set of H-lines v,, = c (0 S ¢ S | n’ |) directed 
from Un,» = —& tO Uy = + is an orthogonal family of directed H-lines and is 
denoted by F,-, the orthogonal family of oppositely directed H-lines being 
denoted by F,. Fy, Px are based on n’. The are on ¥ occupied by the 
a (w)-end-points of the directed H-lines in F,, is the a-are of Fy» (w-are of F.”) 
and is denoted by a, (w,). Let a,-, b,.. denote the two directed H-lines of F,,, 
drawn through the end-points of n’, choosing the notation so that when w,, is 
traversed counter-clockwise it leads from a,’ to b,,. According to Lemma 14, 
the region T,.: 0 < uy S ty , 0 Sv S | n’ |, of &, where sinh (| n’ |/2) < 
tanh 6, 7, = 6 + log 4 sinh 4/| n’ |, contains at least one entire mesh together 



































f with the H-circle in which it is inscribed, and may therefore be transformed by 

, a transformation of G into a region of @ which contains (13). The transforms 

> of n’, Far, Gn, War, Gn, by , Tx by this transformation are indicated by 

" dropping the primes. Note that although either, or both, of a, , b, can be 

2 tangent to (13), neither intersects the interior of (13), and that since n’ lies 

. without 7, , n lies without (13). 

x The transforms of n, Fy , an, @n, Gn, 6,, 7, by the transformation carrying 

. Do into a stated D; (i = 1, 2, --- , 4p) are indicated by adding the superscript 

l (7) to n; thus 7',:) is the transform of T,,. Note that we have 

“ (20) Tx) DD; 

d We now prove three lemmas. 

e Lemma 18. w, contains at least 4p — 4 consecutive [e ;]’s in its interior. 

* For reasons of symmetry, it is sufficient to prove the lemma in the case 
when the w-end-point of a, lies in [es,], but not in [e,]; and we restrict our atten- 

be tion to this case. 

¥. From Lemma 10 it is seen that a, lies in the region Ey. + Ewa + Ey. 

re Suppose the lemma were false. The w-end-point of b, would have to lie in 

of one of the ares [e4,-4], [es»-s], --- , [a]. Let us assume that the w-end-point of 

> bn lies in [es,-4]. From Lemma 10 it follows that b, lies in the region Eins + Eip— 
+ Es + Ey-s. The H-line segment n which joins a, and b, lies outside of 

re (13) and is therefore required to intersect the side ¢4,-2 of Dp produced in two 

me distinct points. This would necessitate that n contain the side ep» of Dy as a 

of segment, which is impossible, since n does not intersect (13). In like manner, 

nt a contradiction may be reached in assuming the w-end-point of b, to lie in any 

to one of [e4,-s], [esp—e], --- , [ex]. The lemma is therefore true. 

of Lemma 19. There are at least two different values of i such that [e;], contains 


a, (i) together with the end-points of wn<i). 
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According to Lemma 18, w, contains an are © in its interior, a, and the end- 
points of w, being contained in ¥ — 2 = A. Hence a,ci) and the end-points of 
w,(:) lie in A”, which in turn, from Lemma 13, is contained in [e,], , for at least 
two different values of 7. 

Let F,; denote an orthogonal family of directed H-lines which may, or may 
not, be identical with F,-. We suppose sinh (|; |/2) < tanh 6, so that the 
region 


y+. ' ’ ‘ ry 
Zn: —Ta, & Ma, < @, 0s», S/[ni|, 


where r,; is defined by (7), contains at least one entire mesh together with the 
H-circle in which it is inscribed. We introduce 


@) neg ' “wh cs 
ni, Ff), apf), w,{*), a,{*), b,{*), T,{*) 


which are defined like n°”, Fycid, nc), @a, Ante, Bacsd, Taco, the réle of the 
region T',, above being taken now by 7',;, so that in place of (20) we have 


(21) Ti? D D; (i = 1,2,---, 4p). 


The following companion lemma to Lemma 19 is then evident if we apply 
Lemma 19 to F,;. 

Lemma 20. There are at least two different values of i such that [e;], contains 
w,{, together with the end-points of a,‘*. 


8. 6-sets of directed H-line segments. Let n’ be an arbitrarily given directed 
H-line segment and let [u,, vx] be the coérdinate system on ® used in §7 to 
define F,,. A 6-set of directed H-line segments A,-, or briefly, a 6-set A,-, is 
defined to be the totality of directed H-line segments having their a (w)-end- 
points on the segment 0 S vy, S | n’| of uy = —6 (uy = 6). A, is based 
on n’, and A, , F,” are said to be associated. 

We now prove the following lemma. 

Lemma 21. Let A, [A,/] be a 5-set based on the directed H-line segment n’ [nj], 
where sinh (| n’ |/2) < tanh 6 [sinh (| n; |/2) < tanh 6]. Let Fy [F,;] be the 
orthogonal family of directed H-lines based on n’ {n;] and associated with A,’ [A,/]. 
Let Ancir [A,§?] (¢ = 1, 2, --- , 4p) denote the 5-set associated with the orthogonal 
family Fi) [F<] of directed H-lines obtained from F,. [F,;] in §7, noting that 
Ani) [A,{] is a copy of An [Ani]. 

Among the F,::) (¢ = 1, 2, --- , 4p) there exists one, say F,.«), which contains 
a set S,««) of directed H-line segments having the following properties: 

I. Each member of S,x) intersects’ A,«) and there exists a A,\, say A,(*), 
which is intersected by all the members of S,«) . 

II. The a-end-points of the members of 8, <«) lie on the H-line segment 


(22) Unik) = —36, 0s v.60 Ss |n'|, 


WOf. §1. 
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and the H-length of each member is less than \, , where 
(23) he on A 4g eee. 
| n’| | ny | 

III. The members of Spx) intersect n“ in the points of an arc mw, which con- 
tains an are v; whose H-length is given by 


(24) [or] =| mfe™. 


Property I. Following Lemma 19, choose k from 1, 2, --- , 4p so that [e;), 
contains a,(®) and the end-points of w,««). This may be done in at least two 
ways. Having chosen k, take ki so that [e;,], contains w,{":) and the end- 
points of a,(*:). According to Lemma 20, this may be done in at least two 
ways, so that we can realize k; = k. Since [ex], , [ex,], (k # k:) have no common 
points, we see that 


(25) wn")? and the end-points of a,{*:) C interior of wax), 
(26) ak) and the end-points of wy‘) C interior of an. 


° ° k — 
Associated with n“, n{ there are the codrdinate systems [unact), vac], 


[un{*., Unit] respectively. n° (n{"?) is the segment 0 < vu < |n'| (0 < 
valt) < | ny |) of usc = O (u,{*) = 0) directed in the sense of increasing V,(*) 
(v,{*:)) and the directed H-lines in F,,«) (F,{*:)) are the coérdinate lines v,«@) = 
d (v,(*? = dy), where 0 < d S |n'| (0 S d; S | n;]), directed from u,u) = 
— 2 (Unt? = — 0) to Une) = +20 (uplt? = +0), 

Using (25), we may show that u,{*:) = 0 lies in the region 


(27) 0< un < +, 0 < mw) <|n'l, 


of @, and therefore that u,{#:) = c (c > 0) lies in (27). Conversely, using (26), 
we find that u,«) = 0 lies in the region 


(28) — 2 <ul? <0, 0 < v6? < | nj] 


of &, and therefore that u,«) = —c (¢ > 0) lies in (28). 

Consider the 6-sets A,j@, A,{*). Here A, (A,{42) is the set of directed 
H-line segments having their a- and w-end-points respectively on the segments 
of Une) = —6, Une = 6 (ult? = —5, un(*? = 6) for which O S ov, S | n’ | 
(0 < v,(*? < |nj|). In order that a directed H-line 1 intersects A,(#), it is 
sufficient that the a-end-point of / lies on a,{*: and that I intersects the H-line 
segment 


(29) Uniti) = 6, 0 < v,(4) < | nj). 


Any directed H-line in F,,:«) intersects A,«@), and since a@,®) C a,(*), those 
directed H-lines in F,,««) which intersect (29) will intersect A,{*). That di- 
rected H-lines in F,,«) exist which intersect (29) is trivial, since u,{*:) = 6 lies 
in (27), and through any given point of (27) there passes exactly one directed 
H-line of Fc). 
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A set S,««) of directed H-line segments which has property I of the lemma 
can then be extracted from directed H-lines of F,.«). 

Property I1. Let 1 denote an arbitrary member of F,,«) which intersects 
A,\*:) in addition to intersecting A,«u). Let A, B, C (Ai, Bi, C1) denote the 
points where / intersects u,w) = —6, Ue = 0, Une = 6 (ult? = —4, 
Un\*) = 0, u,(*1) = 6) respectively. Obviously, B (B,) lies between A (A:) 
and C (C,). A point moving on | from the a- to the w-end-point of | conceivably 
meets A, C, A; , C; in five possible orders, namely, 


(30) ACA,C\, AA,CC), AA,C,C, A, ACC, A, AC\C. 
If we take these cases in the order given, the directed H-line segments 
(31) x, ££, IG, At 

intersect A,««) and A,{*:). Consider the region 


(32) —36 < usw S —, 0S vx. S{n’| 


of @. The a-end-points of the directed H-line segments in (31) all lie in (32). 
This statement is trivial for the first three cases. For the remaining two cases 
we note that | 4,B| < | A,B,|. According to Lemma 6, | A,B; | < | nj | + 6 
and therefore, since 


(33) |ni| < 2%, 


inasmuch as sinh (| n; |/2) < tanh 6 < sinh 4, we have | A,B] < 38, which proves 
the statement for the remaining two cases. 

A set S,««) of directed H-line segments having property I therefore exists 
such that the a-end-point of each member lies on (22). 

Let s be an arbitrary member of such a set S,««). We shall show that | s | 
can be taken less than \; , where \, is defined in (23). Denote the a-end-point 
of s by a. 

For the first case in (30), construct the broken H-line segment aA’C;Ci, 
where A’ is a point on the side e of Dy along which Dy , D, border and C; is a 
point on the side e;, of Dy along which Dy, D,, border. Since D, is contained 
in the region [see (20)] 


T atk): 0< ue Str, S vam) Sin’, 


and D,, is contained in the region [see (21)] 


' , 
T,<tv: —Ta, & unl"? < 0, S va*? S| nj |, 


we have, on using Lemma 6, 
|aA’| < |n’| + 364+ rH, [CiCi| < | mi] +6 + ra. 
Now | A’C! | < 6, so that 


(34) Jal, | < 56+] n'| +] ni] tre +r, 
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inasmuch as | aC;| < | aA’| + | A’C}| + | CiC,|. Using (33) and the in- 
equality 
(35) |n’| < 26, 
which is proved in the same way as (33), and employing (7), we find 

7a 2 

(36) laCi| < 118 + log inh 8) 
|n’| | mi} 

Before disposing of the remaining four cases, we note that 

| AC | = 28, | AC, | < | nj| + 26 < 46. 

For the second case, we have 
(37) |aC,| S |aC| + | AiCi| < 46 + 46 = 88, 
inasmuch as | aC | = |aB| + | BC| = 46. 

In the third case, we have, since | aA | = 28, 
(38) |}aC| = |aA| + |AC| = 26 + 26 = 48. 

In the fourth case, we have, since | aA; | S 26, 

For the fifth case, we proceed as in the third case, and obtain the same in- 
equality (38). 

On comparing (36), (37), (38), (39) with (23), we see that | s| can always be 


taken less than ), . 
Property III. Consider the region 


, 
R,: — 2 < Unit) < +o, 0S »,(4? S [mm | 


of @ and denote the two remaining regions of @ by R:, Rs. From (28), we see 
that n lies in R,. Those members of F,««) which intersect (29) (and there- 
fore intersect A,‘*:) intersect n“’ in the points of an are 4, the two members 
of F,«) drawn through the end-points of 4; passing through the end-points of 
(29) into the regions R., R;. Denote these two members of F,«%) by re, rs, 
choosing the notation so that rz enters the region R,. A point displaced along 
rz (r3) from n“ to the w-end-point of rz (rs) enters, and never leaves R2 (R3). 

One readily sees from the analysis accompanying property II that when 
v,(k) = const. intersects (29), the H-length of the segment of it comprehended 
between n“ and (29) is less than A,. Hence uc) = A, intersects rz, rs at 
points interior to R:, R; respectively. Therefore the H-length of the segment 
of un%) = A, comprehended between r2 and rz exceeds | ni |, since | ni | is the 
greatest lower bound of the H-distances between points of R2 and R;. 

On placing n = mw, k = 4, x > | n;| in (2), we find 


|i | cosh Ay > | ny |, 
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and therefore that 


} ' 2) ny ry —)dy 
| Ma) > et + e™ +e™ > imie 
A set S, of directed H-line segments having properties I and II of the 
lemma, and which also possesses property III, therefore aways exists. 
Lemma 21 forms the basis for the following lemma. 


Lemma 22. Let q denote a positive integer and let Ay, An; , ~~, » +++ Ane be 
1 + q 6-sets based on the directed H-line segments n’, nj, n2, --+ , M, einen. 
where 


(40) sinh (| n’ |/2) < tanhé, — sinh (| n; |/2) < tanhé6 (j = 1,2,---,9). 


Let F ,» be the orthogonal family of directed H-lines based on n' and associated with 
Bas 

Among the copies of F, there is one, say F(a), which contains a set Sytel J 
directed H-line segments having the following properties: 

I. The members of Syiai intersect the copy Ayia) of An» and intersect copies of 
each of the 6-sets Ani, An3, +--+ , Ani. 


q@ 


II. The a-end-points of the members of S,i0) lie on the H-line segment 
(41) Unle] = —36, 0 S vntel S |v’), 
and the H-length of each member is less than X, , where 


(42) = 1léq + log ———,, (4 sinh 6) ' 


, . 
|n’| |i |* | ng |? «++ | mga |? | ng | 





III. The members of S,1«i intersect n'*' in the points of an arc u, which contains 
an are v, whose H-length is given by 
(43) lv] =| nile. 

For g = 1, this lemma is equivalent to Lemma 21. The proof for an arbi- 
trary q is now obtained by induction. 

Suppose the lemma is true for g = r — 1. On this assumption, there is a 
copy F,{r-1) of F,, which contains a set S,{--1) of directed H-line segments 
having the properties I, II, III of the lemma, g being replaced by r — 1. 

Let F,:;_, denote the orthogonal family of directed H-lines composed of those 
members of F’,{r-1) which intersect v;_, and let S,’_, denote the subset of S,tr-1) 
composed of those members of S,{--1) which intersect vit. Let A,;_, denote 
the 6-set based on v;_; and associated with F,;_,. We have 


(44) Astr-1] D A,'_,, 


and note that S,;_, possesses the properties I, II, III (¢ = r — 1) possessed by 
Sntr-11. 

Apply Lemma 21 to A,;_,, 4,;. Using the same notation as was employed 
in §7 and Lemma 21, we see that among the F,{\), (¢ = 1, 2, --- , 2p) there is 











IS 
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one, say F,‘®,, which contains a set S,“, of directed H-line segments having 
the following properties: 
I’. Each member of S,‘#, intersects A,“, and there exists a A,‘*, say A,(*) , 


r—1 *r—1 
which is intersected by all the members of S,‘”, . 
II’. The a-end-points of the members of S,, lie on the H-line segment 
(45) u<®, = —38, 0S, S|yal, 


and the H-length of each member is less than i, , where 


(4 sinh 6)” 


| »—a| | me] 





(46) AX, = 116 + log 


. k) - . r ° 
III’. The members of S,‘#, intersect v2, in the points of an arc @, which con- 
. , . . . 

tains an are 7, whose H-length is given by 


(47) p,| = |n, le. 


| 


Placing g = r — 1 in (43) and substituting the resulting expression for | »;_. | 
in (46), we find 


: 2 
So 1 hon + ig eee, 
| mp1] | m, | 
and therefore, from (42) for g = r — 1, we obtain 
(4 sinh 5)” 


[n’| [ns [? | nsf? +++ [mea l? | n|' 





Ay = 116, + log 


so that 

(48) A, = de. 
On comparing (47) and (48) with (43), we find 
(49) |5| = |v. 
Using (40), we easily: show that 

(50) Anna < Ap. 


Let >,‘®, denote the transform of S,‘_, by the transformation of G which 
carries F,’_, into F,{®,. =,®, has the following properties: 

I’. The members of 2,‘*, intersect the copy A,‘”, of A,;_, and intersect copies 
of the 6-sets A,j, Anj, --- ,Ani_,. 

II’. The a-end-points of the members of 2,{*, fill up the H-line segment (45) 
and the H-length of each member is less than A, . 

III”. The intersection points of the members of 2,{®, with v, fill up vi®,. 

According to I’, I’, the members of both S,\®, , 2,{#, intersect a copy of A,’_,. 
From (44), they therefore intersect a copy of A,-. 

In order to construct a set S,{-) of directed H-line segments possessing the 


properties I, II, III for q = r, we take n'” as that copy of n’ which contains 
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vi. as a segment. F 1-1 is the copy of F,, based on n'’. On comparing I’, 


II’, Ill’ with I”, II’’, III” in the light of (48), (49), (50), we see that a set Syir) 
exists; the are v, in III (¢ = r) being taken as the arc 9, in III’, and the members 
of S,tr1 constructed as directed H-line segments drawn perpendicular to 7, 
with their a-end-points on (45) and their H-lengths less than 4, . 

The following corollary is important in the subsequent development of the 
paper. 

Corotiary. If the H-lengths | n’|, | ni|, |n2|,---, |, | all exceed a posi- 
tive constant y, the H-length of each member of S,\«) ts less than d, , where 


Ag < a{ 113 + 2 log 4 “a ‘|. 


The proof of this corollary is immediate in view of the value given for \, in 
property II of the above lemma. 





9. The surface ¢ and the phase space M. When congruent points on # are 
taken as identical, a topologically closed, orientable surface ¢ of genus p and of 
constant curvature —1 is obtained.” 

Let ge denote an arbitrary directed H-line segment on ?. From Lemmas 15 
and 17 it follows that g» is divided into a finite number of directed segments 
by the meshes of N, each segment being directed in the same sense as go. If 
a mesh containing a segment of gs is transformed into Dy by a properly chosen 
transformation of G, the transform of the segment of g» which it contains is a 
directed H-line segment lying in D,. The totality of directed H-line segments 
lying in Dy obtained in this manner from ge constitutes a directed geodesic seg- 
ment gp on yg. ge represents g, on &. This representation is not unique, any 
copy of gs also representing g, on &. The directed H-line segments in Dy com- 
prising g, are pieces of g,. The sum of the H-lengths of the pieces of g, is the 
H-length of g, and equals the H-length of ge. 

Let P be an arbitrary point on @ and to P attach a direction @ measured in 
the usual way in the z, y-plane. Following Morse, the pair (P, 6) is an element. 
Two elements whose angles differ by an integral multiple of 27 will be taken as 
identical.”* An element (P, @) is on a directed H-line segment ge if P is a point 
of g and @ is the direction of gp at P. Two elements (P’, 6’), (P’”’, 0’) are copies 
of one another if P’ can be transformed into P” by a transformation of G which ° 
takes 6’ into 6’. The distance between two elements (P’, 6’), (P’”’, 6’’) will be 
defined to be 


| P’P’”’ | + min | 6’ — 6” + 2nz |}; 


18 See, for example, Morse II or Nielsen, loc. cit. 

16 Morse I, p.52. Our convention differs from that of Morse, since he regards two direc- 
tions as identical when they differ by an integral multiple of 7, whereas we take two direc- 
tions identical if they differ by an integral multiple of 27. 
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where min | 6’ — 6’ + 2nz| represents the minimum of | 6 — 6” + 2nz | for 
all integers n, positive, negative or zero.” A pair of elements (P%, 6’), (P¢, 0”) 
in which Po, Py are points of Dy possess a g-distance equal to the greatest lower 
bound of the distances from the copies of one to the copies of the other.” 

The set of elements (Po , @) for which Po lies in Dp constitutes the phase space 
M. The distance between two elements in M is taken to be their ¢g-distance as 
defined above. The set of elements in M afforded by the points and directions 
on a directed geodesic segment g, is a phase curve gw , and we shall say g, gener- 
ates gu. If ge represents g, on ®, we shall also say that ge generates gu. 


10. The subsets SY’, Sv, Sy of M. Let ge be a directed H-line segment on ® 
which terminates on N, intersects Dy , and meets no vertex of N. In addition 
ge is supposed to have an admissible sequence 


(51) Aq ++ OmOmi+1 oe dam 


in which a», @m4; arise from the intersections of g» with the boundary of Do. 
Consider the set of directed H-line segments such as ge which have their a (w)- 
end-points on the side a; (a2) of N. Denote this set by S°’, and consider the 
subset of S formed by those directed H-line segments which are segments of 
members of S“ and which lie on Dy. Denote this subset by S$) and let SY’ 
denote the set of elements in M which are on the members of S32. S, S$, 
SY’ are generated by the admissible sequence (51). 

For fixed m, the number of admissible sequences (51) cannot exceed (4p)*", 
so that the number of sets SY’ does not exceed (4p)"". An arbitrary element 
(Po, 0) of M is, however, contained in at least one Sy’. For take the point 
P, of Dy and construct the directed H-line AB passing through P» with the 
direction 6. If AB does not meet a vertex of N, a segment A,B, of AB can be 
taken which has an admissible sequence such as (51), and the set SY’ generated 
by this admissible sequence contains (Py), 6). When AB meets one or more 
vertices of N two cases are possible: (a) AB coincides with an H-line in N ; (b) 
AB does not coincide with an H-line in N. 

Let us consider (a). Here AB is divided into segments each of H-length p 
by the vertices of N and from any vertex V of N on AB radiate 2p — 1 H-rays 
of N into each of the regions #, , 2 into which @ is divided by AB. The 2p — 1 
H-rays radiating from V into 4, are now directed positively away from V. A 
directed H-ray 1; precedes a directed H-ray rz if the angle at V taken positively 
and less than x between r; and the negative sense on AB is less than the cor- 
responding angle for r,. Giving those directed H-rays emanating from a vertex 
V™ on AB precedence over those emanating from a vertex V” of AB if V“” 
precedes V on AB, we may arrange the directed H-rays radiating from the 
vertices of N on AB into 4, in an unending sequence 


(52) oo Ti Puse *** F97 aT % °*: factha *** 


17 Compare Morse I, p. 53. 
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Let us take #, to be the region of ® containing Dy and choose the notation in the 
above sequence so that r_;, ™ correspond to the sides of Do). Consider the 
set S‘ comprising those directed H-line segments with their a (w)-end-points 
on that side of N which lies on r_» (7m) and meets AB. The element (Po, 8) 
is on the segment of AB which belongs to S”. Consequently (Po, @) is con- 
tained in a set S\’. 

The treatment of (b) requires a slight modification of the procedure in (a). 
Here AB meets N at points of two types, those which are vertices of N and those 
which are not vertices of N. As before, & is divided by AB into two regions 
#, , d;, each of which may now contain points of. Do. In certain instances 
one of 4, , 2 may contain no points of Dy, as is the case when AB meets Do 
in a vertex only. In any event, ® shall denote one of the regions which contains 
points of D). When AB meets a vertex V of N, exactly 2p H-rays in N radiate 
from V into®,. These 2p H-rays are then directed and ordered by a repetition 
of the process employed in (a) for the 2p — 1 H-rays there considered. When 
AB meets a point P of N which is not a vertex of N, only one H-ray in N radi- 
ates from P into #,;. This H-ray is directed positively from P. A convention 
similar to that employed above in the construction of the unending sequence 
(52) then leads us to a similar sequence, and from this point onward the pro- 
cedure is analogous to that in (a). 

Take m = 2pr + 1 in (51), where 


(53) r> p/x 
and is a positive integer. From Lemma 15, we see that 
| [41am] | 2 rx > p, | [@m1@2m] | 2 rx > p, 
and therefore, since | ga! > | [a:am] | + | [amsi dem] !, we have 
(54) | ge | > 2p. 


This last inequality shows that a; , a2, cannot have a common end-point. For, 
if they do, ge lies in the H-circle of H-radius p described about the common 
end-point as an H-center, so that | g»| S 2p, which contradicts (54). In addi- 
tion, @; , d2m cannot be segments of the same H-line of N, as this would require 
ge to meet a vertex of N. 

Four distinct members of S” therefore exist which connect end-points of a, 
to end-points of d2m. Two of these, say I, l’, do not intersect. At least one, 
say l, intersects the boundary of D). For suppose this is not the case. Since 
ge intersects the boundary of Dp, the interior of the region of @ enclosed by J, I’, 
a , 2m contains Dy together with its boundary, and hence contains the H-circle 
(12) of H-radius p/2. This last is impossible, according to Lemma 4; thus / 
may be taken as intersecting the boundary of Dy . 

Let A, B (Ao, Bo) denote the a- and w-end-points respectively of 1 (ge). Let 
A’, B’ denote the points (possibly coincident) where | intersects” the boundary 


18 In ease | intersects the boundary of Do in a side of Do , A’, B’ denote the end-points 
of this side. 
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of Do, choosing the notation so that | AA’ | S | AB’ |. A review of the proof 
of Lemma 17 shows that a member A, B, of S’ not meeting a vertex of N can 
be selected so that it intersects the boundary of Ds in points A;, B; which lie 
on the same sides of Do as do the points A’, B’ respectively, and in addition is 
such that A, (B;) lies on a; (a2m) between A(B) and Ao (Bo). Using the method 
employed in the proof of Lemma 16 to deform A,B, into A,B), we easily see 
that the number of symbols in the admissible sequences of A, Aj and BiB; is 
in each case equal to m = 2pr +1. On interpreting A, B, in Lemma 17 as AA’ 
(B’B) and AB in Lemma 15 as A, Aj (Bi B,), we have 


(55) | AA’| 2rx >», | B'B| =rx > p, 
so that | AB | > 2p, and therefore 
| AB| = 2a, + 2p (Ar > 0). 


We introduce two Gaussian geodesic coérdinate systems [un, Val, [Unr, Vn’). 
In the first, A, B are the points [—(A1 + p), 0] and [Ai + p, 0] respectively, and 
the region v, = 0 of & contains the region of # bounded by I, l’, a; , a2». In the 
second, A’, B’ are the points [—k, 0], [k, 0] respectively (k is a constant, positive 
or zero) and the region v,- 2 0 of @ contains the region of @ bounded by I, I’, 
a, , Gem. 

The points R’[—rx, 0], R’[rx, 0] in the [u»’, v,] codrdinate system lie on AB. 
This follows from (55). Let Cj, Cc’, C”, Cy denote the H-semicircles drawn 
in the region v, = 0 of & about the points A, R’, R’”’, B as H-centers respectively, 
each H-semicircle having an H-radius p. Let S denote the set of directed 
H-lines which intersect both C’, C” and are directed from C’ to C’’. The 
members of S intersect u,» = 0 in the points of an H-line segment which we 
denote by n’. n’ is therefore the H-line segment 


(56) Uy = 0, . 0s vy Sin’), 


and v, = | n’ | is tangent to both C’, C’”’.. Since a; (aem) is an H-radius of C; (C1), 
one perceives, on identifying the [u, , v,] coérdinate system introduced above 
with the [u, , v,] codrdinate system in Lemma 3, that 


(57) s> 53". 


If Sz, denotes the subset of S which is formed by directed H-line segments 
which are segments of members of S and which lie in Dp , we have, from (57), 
the fact that S;, D S$”. Hence if Sw denotes the set of elements in M which 
are on the members of Sz, , we see that Sw D S\’. 

For a fixed m, the number of sets Sy does not exceed (4p)°’", each element of 
M being contained in at least one set Sy. 

Next we note that Sz, is a subset of a 6-set A, based on n’, the a @w)-end- 
points of the members of A,, composing the points of the are 0 S< vo, < | n’ | of 
Uy = —6 (ux = 6). We observe that if & denotes the H-distance from the 
center of VY to u,, = 0, we have & S 6/2 < 6. Those elements which are on 
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members of A, and belong to M constitute a set Sy of clements in M which 
contains Sy. 

For a given m, the number of sets Sy, does not exceed (4p)*", each element 
of M belonging to at least one Sy. 


11. The function n(e). Consider a 6-set A, of directed H-line segments 
based on a directed H-line segment n’ which is a segment of a directed H-line 
whose H-distance £ from the center of V is less than 6. Choose a member of A,, 
at random. Let ¢ denote an arbitrary, preassigned positive number. We pro- 
pose to calculate a function n(e) of « such that if |n’| < n(e), an arbitrary 
element on an arbitrary member of A,, lies within a distance ¢« of some element 
on the member of A,, chosen at random above. 

We begin by introducing the Gaussian geodesic coérdinate system [u,’, vn] 
on ® associated with the directed H-line segment n’. The a (w)-end-points 
of the members of A,’ lie on u,, = —6 (ux = 6). Let (P, 6) be an arbitrary 
element on an arbitrary member of A,, , and let [a, b] denote the coérdinates of 
P in the [u,, v,] codrdinate system. The member of A, chosen at random 
above intersects u,, = a at a point P’[a, b’] and has a direction @’ at this point. 
We proceed to derive an upper bound for the distance between the elements 
(P, 6), (P’, 6’). 

The H-distance | PP’ | cannot exceed the H-length of the segment of u,, = a 
lying between P and P’, and this H-length cannot exceed the H-length of the 


segment of u,,» = a comprehended between v,, = 0, v, = | n’ |, so that from 
(2) we find 
(58) | PP’ | S |n’| cosha. 


Now —6 S a S 6. Therefore 


(59) cosh a S cosh 6, .- 


and hence 
(60) | PP’ | < |n’| cosh 4, 


since the equality signs in (58) and (59) cannot hold simultaneously. 
We now take up | @ — 6’|. Let Q denote an arbitrary point of the region 


(61) —-6 3S us & 5, 0S ww S< |n'| 


of 6. Denote by ¢ (¢’) the magnitude of the angle filled up at Q by the direc- 
tions of those H-rays drawn from Q to intersect the segment 0 S v, < | n’ | of 
Un = 6 (uy = —4). If [uy , v,-] are the codrdinates of Q, the angles ¢, ¢’ are 
functions of uy, v, , and we write” 


te 


5; >= Cun: ’ Un’), e = o(uy ; Un‘). 


19 Here ¢(6, v,’), ¢’(—4, vn’) are not properly defined. We place 


€(6, vn’) = £'(—4, vn’) = 2x. 
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If we define Z = min {f, ¢’}, we have Z = Z(u, , v») yielding an upper bound 
for the magnitude of the angle filled up at Q by the directions of those members 
of A,» which pass through Q. If we hold v,. fast, ¢ (¢’) is seen to be a monotone 
increasing (decreasing) function of u,, for —é < u, S 6. Moreover, from 
reasons of symmetry, it is apparent that ((—uy , 0.) = ¢'(Un, vn), so that 
€(0, vn) = ¢'(0, v»). Hence 


Z = ¢when —6 S uy S 0, Z = ¢' when OS u,, 


IIA 


6, 
and therefore, for a given v,’ , 
Z(un , vn) S ZO, vn), —5 5 uy Sb. 


Finally, an elementary calculation which is rather tedious and need not be 
given here shows that” 


Z(0, Un’) Ss Z(0, n’ |/2), 0 s Vn’ = | n’ l, 


so that 
(62) Z(un , vw) S Z(O, | n’ |/2), —5<S uy S 4, 0S vw S jn’). 


In order to obtain an upper bound for Z(0, | n’ |/2), let us introduce a system 
of geodesic polar coérdinates [r, ¥] on &, taking the pole as the point [0, | n’ |/2] 
in the [u,’, v,’] coérdinate system and measuring y positively in the counter- 
clockwise sense from the coérdinate line Y = 0, which is taken as the H-ray 
drawn from the pole to the point with coérdinates [6, 0] in the [u,-, v,] co- 
ordinate system. According to (2), the H-length of the segment of u,, = 6 
which is comprehended between v,, = 0, v», = | n’| is | n’| cosh 6. On using 
(P) to recalculate this H-length, one finds 


| n’ | cosh 6 = / {dr? + sinh’ rdy’*}}. 


Comparing the circle r = 6 with the circle of which u,, = 6 is a segment, we 
have r 2 6 in the above integration, so that 


Z(0, |n’!/2) 
|n’| cosh 6 > / {dr* + sinh? ddy*}' > [ sinh ddy, 
0 


from which we establish that 
(63) Z(0, | n’ |/2) < | n’ | coth 6. 


On combining (62), (63), we see that the magnitude of the angle filled 
up at an arbitrary point Q of the region (61) by the directions of the members of 
A, passing through Q is less than | n’ | coth 6. 

We are now in a position to obtain an upper bound for |@ — @’!. Take the 
member of A,, passing through P (P’) which is a segment of the coérdinate line 


20 Professor Morley has kindly pointed out to me that this result is obtained very neatly 
by using the methods of inversive geometry. 
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v» = b (v, = b’), directed in the sense of increasing u,- , and denote its direc- 

tion at P (P’) by a (a’). We have 
|\@—-@|s|@—al+tjla—a'|+ la’ — 6’|. 


Now neither of | @ — a|, |a’ — 6’| can exceed | n’ | coth 4, and from (3) we have 
|a — a’| <|n’| sinh 6, inasmuch as £ < 6. Hence 


(64) |@ — 6 | <|n’ |{2 coth 6 + sinh 4}. 
From (60), (64) we have 
(65) | PP’| + |@— 6’| <|n’ |{cosh 6 + 2 coth 6 + sinh 6}, 


so that, on referring to the definition for the distance between (P, 6), (P’, 6’) 
given in §9, we see that the function n(e) defined by 


(66) n(e) = e{cosh 6 + 2 coth 6 + sinh 6}~ 
possesses the desired property. 
12. The upper bound for the ergodic function. Let « denote a positive 
number satisfying the inequalities” 
(67) e < (cosh 6 + 2 coth 6 + sinh 6)(1 — e~”), 
(68) « < 2(cosh 6 + 2 coth 6 + sinh 8) log{tanh 6 + (1 + tanh’s)'}, 


where p, 6 are defined in (10), (11) respectively, and let r denote the positive 
integer bounded by the inequalities 


(69) 1 les 2 sinh p 
Xx 








1 2 sinh p 
< - —, 
nite) Sr <t+—me ne) 


Inequality (53) follows from (67), (66) and the lower bound in (69). From (68) 
we find 


(70) sinh me < tanh 6. 
Going back to the construction of n’ in (56) and using Lemma 5, we see that 
(71) pe * <|n'| < 2e™ sinh p, 


when we place \ + p = rx in (6). On using (69), we find that (71) is replaced by 


- p | ; 
(72) Dex sinh p n(e) < | | < n(e). 


Now consider a subset Si, of M and suppose the 6-set from which it is ob- 
tained to be based on n’. The upper bound in (72) coupled with the results in 


21 One of these inequalities probably insures the holding of the other. We need not 
however, determine which inequality possesses this property. 
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$11 shows that the phase curve generated in M by an arbitrary member of A,, 
comes within a distance ¢ of every point of S},, the distance between elements 
of M being defined as in §9. 

r being chosen subject to (69), the number 2m (m = 2pr + 1) of symbols 
in (51) is consequently bounded by the inequalities 


2 sinh p 2 sinh p 
n(e) n{e) 


Let 1 + q denote the number of sets Sh occurring when m is chosen as above. 
Since 1 + q < (4p)*", we have 


S 2m < 4p +2 + Plog 








(73) 2+ 2 log 
x 


2sinhp 


4p 
l+q < (4p)”*(4p) x “°F Co : 


which reduces to 


4p log 4p 
x 








ny 4pt+2 2 sinh p 
(74) 1+4q< (4p) ( ne) 
From the upper ound in (72) and the inequality (70), it is seen that Lemma 
22 may be applied to the 1 + q é-sets introduced above. If we apply Lemma 22, 
it follows that a directed H-line segment ge of H-length less than \, exists which 
intersects copies of these 1 + q 6-sets. From the lower bound in (72) we see that 
we can set y = 3pe “n(e) esch p in the upper bound for A, as given in the corollary 
to Lemma 22. On doing this and making use of (74), we find 


4p log4p 


(75) Aw< | (tp yr (2aehe *) 7 be 1|[2 log Sad re 11. 


If the —1 in (75) is repressed” and if n(e) is replaced by its value give en in (66), 
the inequality (75) may be replaced by one of the form 





(76) Kee" [4 los - + c|, 

where A, B, C and w are given in terms of p as follows: 

(77) A = 2(4p)*”**{2 sinh p(cosh 6 + 2 coth 6 + sinh 8)} <— 
(78) B = 8p ‘ sinhé sinh pe*(cosh 6 + 2coth6é + sinh 5), ; 
(9) C= Ta, 

(80) w= Ee, 


x 
x being defined in (16). 


*2 That the first factor in the upper bound given in (75) is positive may be shown on the 
basis of the assumption (67) for e. 
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Since each element of M is contained in at least one of the sets Sy, these 
results enable us to state the following theorem concerning the phase curve gu 
generated in M by the above directed H-line segment g¢. 

TueoreM. If ¢ denotes a positive number selected in accordance with (67) and 
(68), the phase curve gu comes within a distance ¢ of every element of M. The | 
H-length of ga and therefore that of g, can be taken less than \,, where d, is subject 
to the inequality (76). The right-hand member in (76) is then an upper bound 
for the ergodic function T(e). 

Remark. The constants A, B, C, w in (76) all tend to +” for p> +, 
the constant w, in particular, being bounded by the inequalities 











(81) 4p log 4p er 4p log 4p 
log (7 + 4 V3) ~ log{(1 + 2)(2 + V2vV2+4+ DI’ 


as may readily be verified from the definitions of w and x. 
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NOTE ON THE SIMULTANEOUS ORTHOGONALITY OF HARMONIC 
POLYNOMIALS ON SEVERAL CURVES 


By J. L. WatsH anp G. M. MERRIMAN 


1. In the plane of the complex variable z = zx + iy, the polynomials 


1, z, 2°, --- are mutually orthogonal, not merely on the circumference | z| = 1, 
but also on every circumference | z| = R, in the sense that 
k=l 
/ z2\|dz|=0 k & 1. 
|z|=R 


The general problem of the existence of sets of polynomials in z which are 
simultaneously orthogonal, with respect to suitable norm functions, on each 
of several curves in the z-plane has been studied only recently. Let us say that 
the set p,(z) of polynomials in z is canonical on a rectifiable Jordan curve C 
with respect to the norm function n(z) provided the set p;,(z) is found by or- 
thogonalization on C of the set 1, z, 2’, --- with respect to the positive continuous 
norm function n(z), and provided the coefficient of z* in p,(z) is chosen positive. 
Walsh established’ the orthogonality with respect to a suitable norm function 
of certain Tchebycheff polynomials on all ellipses of a given confocal family. 
Szegé* and Walsh’ showed independently and by widely different methods the 
fact that if the same set of polynomials p;(z) is canonical on two distinct curves 
C and C’, then either C’ is a curve Cz or C is a curve Cx;* Szegd requires ana- 
lyticity of C and C’. [Let C be an arbitrary Jordan curve in the z-plane, and 
let‘the function z = ¥(w) map the exterior of C onto the exterior of the unit 
circle | w| = 1 in the w-plane so that the points at infinity in the two planes 
correspond to each other. We denote generically by Cz the image (Kreisbild) 
in the z-plane of the circle |w| = R > 1 under this transformation.] More- 
over, Szegd’ exhibited all sets of polynomials in z, each set canonical simulta- 
neously on all Cz of a given family, 1 < R < «.° The general problem of the 
existence of sets of polynomials canonical simultaneously on only two curves 


Received October 20, 1936; presented to the American Mathematical Society, Decem- 
ber 1936. 

' Bull. Am. Math. Soc., vol. 40 (1934), pp. 84-88. Also Interpolation and Approximation, 
New York, 1935, p. 134, Theorem 12. 

* Trans. Am. Math. Soc., vol. 37 (1935), pp. 196-206. 

3 Interpolation and Approximation, p. 134, Theorem 11. 

*The analogous result for harmonic polynomials follows directly by the methods of 
Walsh (loc. cit. and Trans. Am. Math. Soc., vol. 33 (1931), pp. 370-388, especially p. 385). 

5 Loe. cit. 

®° These sets are enumerated in §2, below. 
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has not been solved; the second-named writer has, however, some as yet unpub- 
lished results on this problem.’ 

The entire theory of expansions in harmonic polynomials’ in z and y is anal- 
ogous to, but by no means identical with, the theory of expansions in poly- 
nomials in z.’ It is thus in order to study the general problem of sets of harmonic 
polynomials orthogonal, with respect to suitable norm functions, simultaneously 
on several rectifiable Jordan curves; and it is the object of the present note to 
establish the more immediate results concerning this problem. 

We investigate the sets of harmonic polynomials which are obtained by 
separating into real and pure imaginary parts the sets of polynomials in z 
which are canonical, with respect to suitable norm functions, on all curves Cz 
of a given family F. Some of these sets of harmonic polynomials are orthogonal, 
with respect to the same norm functions that were used in connection with the 
corresponding polynomials in z, simultaneously on every Cp of F; the remaining 
sets are orthogonal on no Cz. 

Besides this surprising and varied deviation of the facts concerning the 
orthogonality of these harmonic polynomials from the facts concerning the 
orthogonality of their generating polynomials in z, there is a further interesting 
consequence of our discussion: a new property of the orthogonal polynomials 
in z is derived, which is some cases yields a new formula for the polynomial 
expansion of an analytic function. 


2. We list for reference the sets of polynomials in z which are known to be 
simultaneously canonical, with respect to suitable norm functions, on all curves 
Cr of the given families, 1 < R < ©; from these sets we derive the sets of 
harmonic polynomials to be studied. Where the basic curve C is not a circle, 
we enumerate also the transforms of the polynomials and norm functions under 
the mapping function z = y(w) defined in §1; we denote by. Tx the circle 
|w| = R, the image of Cg. The real and positive norm function can be ex- 
pressed in the form | D(z) |? = | A(w) |’, z = ¥(w), where D(z) [= A(w)] is 
known to be analytic and non-vanishing in the extended z- [or w-] plane out- 
side C [the unit circle | w]| = 1].”° In all cases the norm function on a curve 
Cr may be altered by multiplication by a positive constant, and the entire 
configuration may be subjected to an arbitrary linear integral transformation. 
The orthogonality condition is chosen in the form 





/ plz) p(z) | D(z) |? | dz| = [ plz) p(z)n(w)|\dw} =0 k&1lz2=y(w), 
R R 


C 


7 Presented to the American Mathematical Society, September, 1936; see Bull. Am. 
Math. Soc., vol. 42 (1936), Abstract 279. 

8 For a report on this theory, see for instance Walsh, Bull. Am. Math. Soc., vol. 35 
(1929), pp. 499-544. 

® For example, an arbitrary function continuous on a circumference C can be uniformly 
expanded on C in harmonic polynomials; the analogous proposition for polynomials in 
is false. 

0 Szegé, loc. cit., Theorem 1. 
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where n(w) = | A(w) || ¥(w) | serves as the norm function in the w- 
plane. 
The list (Szegé, loc. cit.) follows: 
I. The set Cz is the set of concentric circles |z| = R > 0; 
D(z) = 1; p(z) = 2*. 
II." The set Cz is the set of concentric circles, |z| = R > 1; 


D(z) = (1 — 2°)", ea positive integer; 
| D(z) ) = R*2°/(z* — 1)(R* — 2°); 
px(z) = 2" for 0 < k < a; px(z) = 2° “(2 — 1) fork = a. 
III.” The set Ce is the set of confocal ellipses, foci +1; 
Dz) = {2+ @ — 1)3}' @ — 17432 = yw) = wt w); 
A(w) = {3(1 — w™)}?; n(w) = 1; pe) = wt + w™. 
IV. The set Ce and z = ¥(w) as in III; D(z) = {z + (2 — 1)'} (2 — 1); 
A(w) = {3(1 — w)}*; n(w) = (w® — 1)(R* — w’)/4R*w’; 
pr(z) = (w** — w*)/(w — w). 
V. The set Cz and z = ¥(w) as in III; D(z) = (2 — 1)(z + 1)°; 
A(w) = (1 — w')*( + wy; n(w) = (w — 1)(R* — w)/2R*w; 
pez) = (w'*? — w*)/(wh — w"). 

We shall suffix the subscript H (e.g., Is) to the above numerals I-V to 
designate the harmonic polynomials derived from the sets I-V by separation 
into real and pure imaginary parts. We reiterate that in the cases III, IV, V 
the notation n(w) does not represent merely the transform of the norm function 
n(z) = | D(z) |’ in the z-plane, but represents n(z) | ¥’(w) | = | A(w) |*| ¥/(w) |, 
the norm function in the w-plane. 


3. It would perhaps seem most natural to offer real-variable proofs of our 
results concerning the harmonic polynomials In—Vxa , using trigonometric forms 
of both polynomials and norm functions. Although such proofs exist, it seems 
simpler and more fruitful to carry out the discussion by methods of complex 
variables. For this purpose we first proceed to obtain some general theorems 
concerning orthogonality of polynomials in z and of the harmonic polynomials 
which result from separating them into real and pure imaginary parts. 

Let the set of polynomials =: {p,(z)}, canonical or not, be orthogonal with 
respect to the positive continuous norm function n(z), on a given rectifiable 
Jordan curve C: 


(1) [ merrwne|ae| = 0 k#l. 


If each polynomial p,(z) is separated into real and pure imaginary parts, 

‘ - . . . . . 

Pe = Pe + tpx, We obtain a set of polynomials in x and y each of which is 
harmonic in the entire plane: 


This set was exhibited by Szegé (Math. Ann., vol. 79 (1919), pp. 323-339), but without 
mention at that time of orthogonality on more than one curve. Compare Walsh, Mémo- 
rial des sciences math., Fasc. 73, p. 43. 

" This is the Tchebycheff set proved by Walsh to be orthogonal simultaneously on 
all Cr. 
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S: 4 "ow ow 
| Po, Pi, P2,*** 


If po(z) is real and of degree zero, which necessarily occurs if the set = is canonical, 
the polynomial po’ vanishes identically and may be omitted here. An arbitrary 
set of form S is orthogonal on C with respect to the norm function n(z), pro- 
vided that the three sets of conditions 


ale 


I 
o 


[ri vincey ae k & 1, 


(2) [ mivine |dz| = 0 kL 


[ mivi’ne) |az| = 0, 
I 


are satisfied. Under the present assumption of the orthogonality of the set 2, 
the second conditions in (2) are satisfied when and only when the first condi- 
tions are satisfied; thus, for harmonic polynomials obtained by separation of 
orthogonal polynomials in z into real and pure imaginary parts, the orthogonality 
conditions (2) are equivalent to 


(3) [ pi pin(z) | dz 


= O fork & l, [ vive | dz| = 0. 


We observe that if conditions (1) and (3) are satisfied, there are also satisfied 
the conditions 


(4) / _px(2)pr(z)n(z) | dz | = 0 kL. 


Conversely, if both (1) and (4) are satisfied, and if in addition 


(5) [ viv’ ne |de| = 0 Nal 


we may infer the satisfaction of conditions (3). Conditions (5) may be re- 
placed by a condition on the pure imaginary component of an integral: 


(6) Sg if [pe(z)}’ n(z) | dz ; = 0 k=0,1,2,---. 


We may thus state 

THeorEM 1. [If the set {p,} of polynomials in z is orthogonal on a rectifiable 
curve C with respect to the positive continuous norm function n(z), then it is necessary 
and sufficient for the orthogonality on C with respect to n(z) of the set of harmonic 
polynomials (p;., p;) obtained by separating each p,(z) into real and pure imaginary 
parts that conditions (4) and (6) [or (5)] be satisfied; that is to say, it is necessary 
and sufficient that the integral 
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ois [ pu(z)pr(e)n(z) | de | 


vanish for k *= l and be real for k = l. 

It is easy to prove, by combining formally conditions (2) so as to yield 
conditions (1), 

TuEoreM 2. If the set {p,, pi} of harmonic polynomials, where p. and p, 
are conjugate harmonic functions, is orthogonal on a rectifiable curve C with respect 
to a positive continuous norm function n(z), then the set {px = pr + ip,’ } of poly- 
nomials in z is also orthogonal on C with respect to n(z). 

It is Theorem 1 to which we shall appeal in our study of the orthogonality 
properties of the harmonic polynomials In-Va. The integral J, however, is 
also of interest and importance in connection with an expansion problem. Let 
the function f(z) be analytic on and within C; the formal expansion of f(z) 
on C in terms of the polynomials {p,(z)}, now assumed canonical on C, is 


(7) fe) ~ ¥ apie), 

where 

8) a= [ sernieincer\ae| / [ pierpi@nte)| de, a. oe 
of course we have the relation 

(9) [ere n(z)|dz| ~ 0 j = 0,1, 2, 


Under the present hypothesis, the relation (7) is an actual equation, valid uni- 
formly on and within C.™ 

If now conditions (4) are also satisfied by the set {p,(z)}, we can obtain 
an alternative expression for the coefficients a; valid under certain circum- 
stances. Multiplication of equation (7) by p,(z)n(z) and integration term by 
term over C yields 


a) a= | fe) vie) me) de | / [trier me) [de| 5 = 051,20, 


provided of course that 


a) [worn |dz| <0 ju i&--. 
We collect these results as 


13 This theorem is due to Szegé in the case n(z) = 1 with C analytic, to Smirnoff in the 
case n(z) = 1 with C rectifiable and satisfying an auxiliary condition, and to Walsh in 
the general case that both n(z) and C are arbitrary. See Walsh, /nterpolation and A pprozi- 
mation, §5.2. 
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TuroreM 3. If the set of polynomials {p,(z)} is canonical on C with respect 
to a norm function n(z), and also satisfies conditions (4) and (11) (that is to say, 
if the integral J vanishes for k *= 1 but not for k = 1), then an arbitrary function 
f(z) analytic on and within C has the formal expansion (7), where a; is given both 
by (8) and by (10). The expansion (7) is valid uniformly on and within C. 


4. The general results established in §3 will now be applied to the study of 
the orthogonality with respect to the given norm functions, of the sets In—Vu 
of harmonic polynomials, on all Cz of the given families of curves. The com- 
putation will be carried out in the plane in which the family of curves is a set 
of concentric circles, that is, after the usual mapping z = y(w) in cases IITa—-Vu . 
Let Ix designate the circle |w| = R > 1; Te coincides with Cz in cases Ia 
and Ilu and is its image in cases I[In—-Vz . 


In case Ia, 
J= [ 2‘! dz| = i ght" de. 
Cr tJrr 


Hence J = 0 for all choices of k and 1 except k = 1 = 0; in the latter case, 
however, equation (5) is satisfied, since po is real. Thus conditions (4) and (6) 
of Theorem 1 are completely satisfied, and we have established the well-known 
fact that the harmonic polynomials are orthogonal simultaneously on all Ce . 
But condition (11) is not satisfied, so formulas (10) for the coefficients a; are 
not valid. 

In case IIx (the study of IIs is more complex and we postpone it temporarily), 


J = [ (wi + w*)(w' + w™) | dw| 
CR 


ad R [w* +1—1 rv oe il re oe *h-2 ww} dy 

t CR 
Hence J = 0 immediately if k + 1, so conditions (4) are satisfied. If k = l, 
we have 


where c = 4 if k = 1 = 0; otherwise c = 2. Since this value of J is real and 
non-vanishing, conditions (6) and (11) are satisfied. We conclude from Theo- 
rem 1 that the set Illu is orthogonal simultaneously on every Cz, and from 
Theorem 3 that formulas (10) for the coefficients a; in (7) are valid.” 


4 A form of orthogonality other than (1) and (4) is also of interest here, namely 
[ew Pr(z) pi(z) dz = 0 kel 
Cc 


where Q(z) is suitably chosen. See Geronimus, Trans. Amer. Math. Soc., vol. 33 (1931), 
pp. 322-328. 
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5. Among the sets (a = 1, 2, --- ) of polynomials IIs , the sets IIx , a > 2 
are orthogonal on no Cz ; the sets IIx , a = 1 or 2 are orthogonal on every Ce . 

We have the following three expressions for J for the three possible choices 
of k and I relative to a: 








: Rt k+l—a-l a ae dz 

nok ft —— ) k 2a,l 2a; 
R*™ +1 tt dz 

Aa rp R* — 24 k2za,l<a; 
naa | grtite-t dz 

sedan wl tp G@* — 1)R™ 2) k<a,l<a. 


The integrals J; and J2 both vanish, for all pertinent choices of k and l, by 
Cauchy’s integral theorem, since there are no singularities of the integrands 
interior to Tg. The integral J;, however, possesses singularities at points 
inside Tz, namely at the a-th roots of unity; we evaluate J; in the following 
manner. 


Lemma 1. Let w be a primitive a-th root of unity. The sum > w™ has the 


m=1 
value a or 0 according as q is or is not a multiple of a. 
The proof is left to the reader. : 
The value of the integral J; , aside from the constant factor, is the sum of 
the residues of the integrand at the points w, w ,---,* = 1. At the point 
w” the residue is 


k+l+a-1 
Oni gt tite se” 
Tv . 
Rte — 22 2 — 1 Jeogm 


at (o— +(k+l+a—1)(z - a] 
= 45 a(R? eae 2°) geil a(z* = 1) ze" — 








m(k+l) 
= 277 a(R — 1)" 
Hence the integral J; has the value 
Ln eT 
(12) I= Rey os” 


’ |0,if k + Lis not a multiple of a, 
| QeR***+*/(R* — 1), if k + Lisa multiple of a. 


Concerning the polynomials IIn , a = 1, we conclude immediately that the 
conditions of Theorem 1 are completely satisfied: J; = 0, J2 = 0 in all circum- 
stances; the integral J; is of significance only when k = | = 0, in which case 
the condition on J; is (5), fulfilled by virtue of the reality of po. These poly- 
nomials IIx , a = 1, are therefore orthogonal simultaneously on every Ce. On 
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the other hand, conditions (11) fail for 7 > a, so Theorem 3 is of no signifi- 
cance here. 

We turn to the set Ila , a = 2.. By (12) we have J; = O fork = 1,1 = 0; 
and J; is real fork = 1 = Oor 1. These are the only cases not covered by 
J; = Jz = 0; thus the conditions of Theorem 1 are satisfied, and the polynomials 
IIn , a = 2, are orthogonal simultaneously on every Ce. Again Theorem 3 
fails to be applicable. 

But if we examine any set Iu , a > 2, there always exist pairs of polynomials 
of the set which fail to satisfy conditions (4). For example if we choose 
k = a — 1,l = 1, we find from (12) that J; * 0, k * 1; hence conditions (4) 
of Theorem | fail to be satisfied, so the polynomials Ila, a > 2 are mutually 
orthogonal on no Cx . 


6. We turn now to the proof of the orthogonality of the set [Vu on all Ce. 
We have 





~ 4R% w—-wt w—w w* 
_ ane | (w™* re : tee 4 w* + pas + 1)(w"** pe 1)(R* oe w’) dw 
- 4R% rR ’ 


wets 





1 w'! = wt wt a ot (w° a 1)(R* a w*) dw 
CR 





We expand the numerator, and evaluate the corresponding integrals separately. 
Suppose first k 5 1; it is no loss of generality to assume, as we do,! > k. The i 
terms of J resulting from 


(w* + wh 4... + 1)w!?(R! _ w’) 


vanish because in this product each exponent of w is greater than k + 1 + 2; 
we haye 21 + 2 >k+1+ 2. The terms of J resulting from 


(w* + w** + --- + 1)(R* — w’) 


vanish because in this product each exponent of w is less than k + 1 + 2; we 
have k + 1+ 2 > 2k + 2. Hence conditions (4) are satisfied. Suppose next 
k = 1; the only terms in J which do not obviously vanish are those contributed 
by the terms R*w*'** and w**? in the numerator of the integrand. We have 


7(R* + 1) 
oR 
Hence conditions (6) of Theorem 1 are satisfied, so the set [Vu is orthogonal 
simultaneously on every Cg. Conditions (11) of Theorem 3 are also satisfied, 
so that theorem is applicable. 

Methods similar to these serve to prove that the set Va is orthogonal simulta- 
neously on every Cz , and that Theorem 3 is applicable; we omit further details. 


J= 





7. One further pertinent deduction is to be made, from Theorem 2 of §3. 
In view of our study of the orthogonality, on all Cz , of the sets of harmonic 
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polynomials In—Va , and in view of the fact that the polynomials I-V are the 
only sets of polynomials in z which are canonical simultaneously on all C, 
of a given family, Theorem 2 yields the conclusion that the sets In , [ln for 
a = 1 or 2, IIIn—-Vu are the only sets of harmonic polynomials orthogonal 
simultaneously on every Cz of a given family which can be obtained by separa- 
tion into real and pure imaginary parts of canonical sets of polynomials in 2. 

The case of the polynomials IIn , a > 2, illustrates the falsity of the converse 
of Theorem 2. 


8. We remark that the results just proved on the orthogonality of the sets 
In—Vu can also be established by the use of real variables. Various integrals 
that present themselves are analogous to Poisson’s integral, and can be evaluated 
at once by identifying the given integrals with Poisson’s integral for suitably 
chosen integrands and suitably chosen values of the parameters; in each case 
the corresponding Dirichlet problem has for its solution a fairly obvious har- 
monic polynomial. It might be supposed that the real-variable proofs are 
preferable and more natural in the study of harmonic functions, but as a matter 
of fact the methods of the complex variable as we have used them are both 
more fruitful (see for example our Theorem 3) and more illuminating as to the 
general structure of the harmonic polynomials studied (see conditions (4) and 
(6) of Theorem 1). 

For the sake of reference we list, in trigonometric form, the harmonic poly- 
nomials In—Vu , together with their respective norm functions: 

In. px(z) = R* cos k0, py, (z) = R* sin k0; n(z) = 1; 


z = R(cos 6 + isin @). 
IIn. pi(z) = R* cos k0, p, (z) = R* sin k@, for 0 S k < a, 
a@ & positive integer; 
p.(z) = R* cos ko — R*~* cos (k — a)@, 
pi (z) = R‘ sin ko — R** sin (k — a), fork = a; 
n(z) = R°*/(1 — 2R* cos ad + R**);z = R(cos @ + isin 8). 
IIx. pi(z) = (R* + R™) cos k0, p; (2) = (R* — R“) sin ko; 
n(w) = 1; w = R(cos 6 + isin 6);z = ¥(w) = (w+ w'). 
IVa. pi(z) = R°{(R*** — R*")\(R — R") cos (k + 1)8 cos 8 
+ (RX + R*")(R + R*) sin (k + 1)6 sin 6}/(1 — 2R* cos 20 + R*); 
pie (2) = RY(R*™ + R“*")(R — R™) sin (k + 196 cos 6 
— (R** — R*")(R + R™) cos (k + 1)@ sin 6}/(1 — 2R* cos 20 + R‘); 
n(w) = (1 — 2R’ cos 20 + R‘)/4R*; w = R(cos 6 + i sin 8). 
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Vu. pe(z) = R{(R** — R*)(R! — R™) cos (k + 4)0 cos 40 
+ (BR? + R*)(R + R) sin (k + 4)6sin $6} /(1 — 2R cos 6 + R’); 
pe (z) = R{(R*** + R*)(R! — R74) sin (& + 4)0 cos 40 
— (R* — R*)(R! + R-) cos (k + 4)6 sin 36}/(1 — 2R cos 6 + R’); 
n(w) = (1 — 2R cos 6 + R’)/2R’; w = R(cos 0 + i sin 6). 


Harvarp UNIVERSITY AND THE UNIVERSITY OF CINCINNATI. 














CERTAIN TERNARY CUBIC ARITHMETICAL FORMS 
By E. T. Bretu 


1. By an obvious oversight, G. B. Mathews stated (without restriction on 
the integer n) that if the integer m is representable by integers z, y, z in the form 


z+ ny’ + n°2* — 3nzyz, 
it can be represented in an infinity of ways.’ If n is the cube of a rational 
integer, and m ¥ 0, the number of representations, if any, is finite. We shall 
show how these representations may be found. In certain simple cases (§5) 


it is possible to find the exact number of representations; in all cases we give an 
upper bound (§4) for this number. 


2. We consider the representations by integers z, y, z of the integer m in the 
form 


(1) ot+ty + te? — 3¢ xyz, 


where ¢ is a constant integer ~ 0. If m = 0, we have the infinity of representa- 
tions (x, y, z) = (ht’, ht, h), where h is an arbitrary integer (and possibly further 
representations). 

Henceforth we shall take m # 0. Let m = dé, where d, dare integers. From 
(1) we have 


(2 + ty + @2z)(2° + fy? + te? — tay — Prz — Hyz) = dd; 
hence we may take 
(2) a+ty+tz =d, 
and equate the second factor to 6. Replacing tz by d — x — ty, we get 


3(2° + try + fy’ — dx — dty) +d —6 =0. 


Thus 

(3) d — 6 = 0 mod 3; d — 6 = 3h, 
where h is an integer, and 

(4) a+ (ty—dxt+ty — dy+h=0. 


Received November 19, 1936. 

1G. B. Mathews, Proceedings of the London Mathematical Society, vol. 21 (1891), 
pp. 280-7; see also Dickson’s History of the Theory of Numbers, vol. 2, 1920, p. 594. In 
applying Dirichlet’s theory of representations by norms o! algebraic integers, Mathews 
omitted to state, loc. cit., p. 281 (as he evidently intended) that the real cube root of the 
n he is considering must be irrational. 
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In order that (4) shall have an integer root x it is necessary that the dis- 
criminant be an integer square a’; whence 


(5) 3ty? — 2dty + a® + 4h — d’ = 0; 


and in order that (5) shall have an integer root y it is necessary that the dis- 
criminant be an even integer square 4b}; thus 


dt’ — 3t'(a*° + 4h — d’) = 
It follows that ¢| b;, say b: = tb. Hence, by (3), 
3a° + b* = 


and from (4), (5), (2) we find the values of x, y, z stated in the following 
THEOREM 1. AIl integer solutions (x, y, z), if any, of 


at+ty + tz — 3xyz =m 

in which t is a constant integer ~ 0 and m is any integer ¥ 0, are given by 
(6) 6c=2%d-—b+3a, 3y=d+b, Gf: =2d—b—3a, 
where a, b are integers such that 
(7) 3a” + b° = 46, 
and d, 6 are integers such that 
(8) m = dé, d= mod 3. 

Hence m, d are both positive or both negative, and 
(9) (d, 6) = (0, 0), (1, 1), (—1, 1) mod 3. 

THEOREM 2. The only m # 0 representable in the form (1) are of the forms 
9n, 3n + 1. 

3. Taking the first of (9), we have (d, 6) = (3d, , 36,)._ Hence, from the first 
of (6), 3|b, b = 3b; 

= d, — 3(b, — a), bh =a mod 2, 

the last since z is an integer. From (7) we now have a’ + 3b] = 46,. Taking 


(a, bi) = (2a2 ’ 2be), (2a; + 1, 2b; + 1), 


and dropping all suffixes, we get from Theorem 1, 
THEOREM 3. All integer solutions (x, y, z), if any, of 


ot ty + & — 30xyz = 9n, n ~ 0, 


are given by 


=d—b+a, ty = d + 2b, tz =d—b-—a, 
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with a, b, d, 6 integers such that 
(10) n = dé, a’ + 3b° = 6, 
and 
xr=d-—b+a, ty = d+ 2b+1, (z=d—b-—a-—1l, 
with a, b, d, 6 integers such that 
(11) n = dé, (2a + 1)° + 3(2b + 1) = 48. 


Similarly, the second and third of (9) give the following 
THEOREM 4. All integer solutions (x, y, z), if any, of 


at ty + t2* — 3@ryz = n, n=1 -mod 3, 
are given by 
z= (d-1)/3 -—b+a+1, ty = (d — 1)/3 + 2b, 
tz = (d — 1)/3 —b—a, 
with a, b, d, 6 integers such that 
(12) n = dé, d=6=1 mod 3, 
3(2a + 1)* + (6b — 1)? = 48. 
TueroreM 5. All integer solutions (x, y, z), if any, of 


ot+ty + t2* — 3txyz = n, n 


lll 
| 
— 


mod 3, 

are given by 

z= (d+1)/3—b+a, ty=(d+1)/34+2b, #2 =(d+1)/3-b-—a-1, 

with a, b, d, 6 integers such that 

(13) n = dé, d=-6=2 -1 mod 3, 
3(2a + 1)*° + (6b + 1)* = 46. 


4. An upper bound to the number F(n) of representations (z, y, z) of n in the 
form (1) is easily obtained from the number N(n) of representations (a, b) of n 
in the form a’ + 3b°. Let n = 2°m, where m is odd. Then’ if E(m) denotes 
the excess of the number of divisors 3h + 1 of m over the number of divisors 
3h — 1, 


N(n) = 0, if a is odd; 
N(n) = 2E(m), ifa = 0; 
N(n) = 6E(m), if a is even, > 0. 


Referring to (10), (11), we have (n # 0), 
F(9n) = 6[D) N() + Do N(48)], 
2 L. E. Dickson, Introduction to the Theory of Numbers, 1929, p. 80, Exercise 3. 
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where the sums refer to all divisors 6 > 0 of 2°m. These divisors are 2°5’, where 
0 < B S a, and 3’ is any divisor > 0 of m. A short reduction of the resulting 
inequality for F(9n) gives 

TueorEM 6. If F(k) denotes the total number of representations of k in the 
form (1), 


F(9n) S [6a + 5 + 3(—1)*] > E(8), 


where n = 2°m, a = 0, m is odd, and > refers to the divisors 6 > 0 of m. 

The factor 6 enters, since a representation (x, y, z) for a particular (a, b) 
contributes at most 6 representations by permutations of x, y, z. In the same 
way, (12), (13) give 

THEOREM 7. Jf n = +1 mod 3, 


F(n) < 12 >> E(), 
where >> refers to all divisors § > 0 of n. 


5. Taking ¢ = 1 in (1) we now consider representations in 
(14) aotyt+2° — 8zryz. 


THEOREM 8. All integers = 0 mod 9, or = +1 mod 3, and only these, are repre- 
sented in the form (14), and for integers # 0 the number of representations is finite. 

This follows from Theorem 2 and (10), (11), (12), (13), since each of these 
equations has at least one integer solution when 6 = 1. The first part of this 
theorem was proved otherwise by Carmichael,’ who showed also that for integers 
> 0 in each case there is a representation (x, y, z) with 2, y, z non-negative. 
Here, taking t = 1 in Theorems 3, 4, 5, we reach the same conclusion, with the 
additional information in 

THEOREM 9. In the form (14), 9n has the representation (n + 1, n, n — 1), 
and if n > 1, a representation (x, y, z) withx > 0,y > 0,2 > 0;n = 1 mod 3 
" + . 2 " J - ¥ ‘), and if n > 1, a representation 
(xz, y, z) with x > 0, y > 0, z > 0; n = —1 mod 3 has the representation 
(" +1 n+1 

1 

y>0,z> 0. 

Considerably more may be stated for certain special forms of n. With N, E 
as in §4, let p be a prime > 0. Then N(p) = 4 or 0 according as p = 1 mod 3 
or p = —1mod3. Hence for p = 1 mod 3 there is precisely one representation 
(a, b) = (a, 8), with a > 0, 8 > 0, of pin the form a’ + 3b. A straightforward 
application of these remarks to Theorems 3, 4, 5 with ¢ = 1 gives 

THEOREM 10. If p is a positive prime = 1 mod 3, there are precisely 18 repre- 





has the representation ( 





; : ‘ *), and if n > 2, a representation (x, y, z) with x > 0, 


’R. D. Carmichael, Bulletin of the American Mathematical Society, vol. 22 (1915), 
pp. 111-117. 
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sentations (x, y, z) of 9p in the form x* + y*° + 2 — 3:ryz, obtained by permutations 
of x,y, 2 in 


(x,y,z) = (p—1,p,p + 1), 
(l+a+8,1—28,1—a+ 8), 
(1 + a — 8,1 + 28,1 — a — 8), 


where (a, 8) ts the (necessarily) unique representation (a, b) = (a, 8) of p in the 
form a’ + 3b’ with a > 0, b > 0; if p is a positive prime = —1 mod 3, there are 
precisely 6 representations of 9p, obtained by permutations from (p — 1, p, p + 1). 

The implied distinctness of these representations is easily seen from the primal- 
ity of p and a simple contradiction. The corresponding theorems for p < 0 
follow at once if we note that 6 in preceding theorems is > 0. Similarly we find 

THEOREM 11. The only representations of the positive prime p = —1 mod 3 
in the form x° + y° + 2 — 3zxyz are the three obtained by permutations from 
(? +1 p+l1 p- *) 

eet Ae Fs 

The following immediate consequences of Theorem 7 are of some interest. 

THEOREM 12. If p is a positive prime, a an integer = 0, and if G(n) denotes 
the total number of representations of n in the form (14), 





G(p*) S 6(a@ + 1)(@ + 2), p=1 mod 3; 
G(p*) S 3[2a + 3 + (—1)%], p=-l mod 3. 


The first part of Theorem 8 (Carmichael’s result) has been generalized by 
Hua‘ to any circulant. 


CALIFORNIA INSTITUTE OF TECHNOLOGY. 


4L. K. Hua, Tohédku Mathematical Journal, vol. 39 (1934), pp. 316-321. 





ELEMENTARY PROOFS OF SOME KNOWN THEOREMS OF THE 
THEORY OF COMPLEX EUCLIDEAN SPACES 


By M. H. Stone anp J. D. TAMARKIN 


In this note we shall indicate strictly elementary proofs of certain well-known 
fundamental theorems concerning a complex Euclidean space 2.’ Such proofs 
are not difficult to construct and must be widely known, but the proofs actually 
available in the literature repose, in many cases, upon arguments drawn from 
topology or from the spectral theory. It may therefore be helpful to put on 
record proofs of a more elementary nature. 

We begin with a proposition due, in its general form, to Banach.” 

THEOREM 1. [f the sequence {gn} in & has the property that {(f, g,)} is a bounded 
sequence for each f in &, then the sequence { | gn | } ts also bounded. 

In constructing a proof by contradiction there is no loss of generality in 
assuming that (f, g.) — 0 for each f in 2. Indeed, if we make the assumption 
that { | g, | } is not bounded, we can select a subsequence {g',} of {g,} such that 
|g, | = n’, and we can then consider the sequence {g,}, where g.. = g.,/n, for 
which the properties |g, | = n, (f, g.) — 0 are evident. 

We therefore proceed to obtain a contradiction from the assumption that 
|gn| — ©, (f, gn) 0 for each f in %. Now by selecting an appropriate subse- 
quence and renumbering its elements, we may further suppose that 


lg. | 2 2", | (gm, 9n)| S 1 when m # n. 


In fact, if we have chosen g;, gz, «++ , gw, the relations | g, | ~ ©, (gm, Gn) > 0 
form = 1, 2, --- , N allow us to choose gy; from the remaining members of the 
original sequence so that 


N+l1 | 

lgvnn|. 22°", | (gm, Q9n+i)| S1 form = 1, 2,---,N. 

An obvious inductive construction therefore provides us with the desired subse- 
20 


° ° 1 ! 2) — . 1 12 
quence. The inequality | g,/|g,.|°| S 2" shows that the series Zz Qr/\ Gr | 
v=1 


i= 


is dominated (in norm) term for term by the convergent series > 2”, and 
1 


therefore is convergent in % to a limit f. By assumption (f, g.) ~ 0. On the 
other hand, we have the contradictory inequality which follows: 


Received December 30, 1936. 

1 By a complex Euclidean space we mean a complex linear vector space with definite 
Hermitian-bilinear inner product (f, g), complete in the metric defined by |f — g | = 
(f -—g,f —g)*. Hilbert space is a special case. : 

2? Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 80. 
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l(f, 90) | = Lowa! =|14+ D @, gn)/{\g. "| 


ven 


>1- D1G@.,9)\/|o PF 21—- D4” = =. 
vein veal 3 
Consequently, our theorem is established. 

As an immediate corollary of this result, we have 

THEOREM 2. If {(f, g,)} converges for each f in &, then there exists a uniquely 
determined element g in ¥ such that (f, gn) — (f, g). This element g belongs to the 
closed linear manifold M generated by the sequence {g,} and satisfies the inequality 
|g | & G, where G = lim inf |g, | < + &. 

Theorem 1 shows at once that G is finite. Now the limit L(f) of (f, g,) obvi- 
ously depends linearly on f and, by virtue of the Schwarz inequality | (f, g,) | S 
\f| | gn |, satisfies the inequality | L(f)| < G|f|. A theorem of Riesz’ there- 
fore asserts the existence of a unique element g in % such that L(f) = (f, g) for 
every f in &, and such, furthermore, that |g| < G. Since f is in the orthogonal 
complement P* of M if and only if (f, g.) = 0 for every n, we see that every 
such f satisfies (f,g) = L(f) = 0. Thus g belongs to the orthogonal complement 
of M*—that is, to M** = M. 

It should be noted that the inequality |g| < G can hold; for example, if 
{gn} is an infinite orthonormal set, we have g = 0,G = 1. 

We now establish a well-known selection principle. 

TueroreM 3. In order that {g,} contain a subsequence {g.,} such that {(f, g..)} 


converges for each f in &, it is necessary and sufficient that G = lim inf |g, | < + ~. 


The necessity of the stated condition follows at once from Theorem 1. The 
sufficiency is proved by a quite familiar argument which we repeat only in out- 
line. By restricting attention to a suitable subsequence and renumbering it, 
we may suppose that |g, | < K, where K is a fixed but arbitrary constant ex- 
ceeding G. The inequality | (gm, gn)! S |gm!{gn| < K° then permits us to apply 
the diagonal process so as to obtain a subsequence {g,} for which the sequences 
(gms Qn)}, m = 1, 2,3, --- are all convergent. This is the desired sequence. 
Dropping primes, we have |g, | S K and {(g», gn)} convergent for every m. 
If f is an arbitrary element in %, we can write f = f; + fe , where f; is in the closed 
linear manifold IM generated by {g,} and fz is orthogonal to M@. Thus, if g is an 
arbitrary element of &, we have 


(FS, 9m) — (F, 9x) | = | fis gm) — (fis Gn) | 
= | (fi — 9, 9m) | + | (fi — 9,90) | +1. gm) — G, Gn) | 
S 2K \fi—g9| + |G, 9m) — gn) |- 
Here we can choose g as a linear combination of g; , gz, --- so that the first 


°F. Riesz, Zur Theorie des Hilbertschen Raumes, Szeged Acta, vol. 7 (1934), pp. 34-38. 
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term is rendered small; and the second term then becomes small whenever m 
and n are both sufficiently great. Hence {(f, g,)} converges for arbitrary f in &. 

We can restate the results of Theorems 1-3 in other terms by introducing the 
following definitions: a sequence {g,} is said to converge weakly or to be weakly 
convergent if {(f,g,)} converges for each f in 2; to have a weak limit g if (f, gn) — 
(f, g) for each f in &; to be weakly bounded if {(f, g,)} is bounded for each f in ¥&; 
and to be bounded if { |g, |} is bounded. In fact, it is easily seen that our 
theorems imply the following propositions: 


(1) a sequence is weakly bounded (if and) only if it is bounded; 

(2) 2 is weakly complete—that is, every weakly convergent sequence has a 
weak limit in %, necessarily unique; 

(3) every closed linear manifold 2 is weakly closed and weakly complete 
that is, a weakly convergent sequence in Jt has a weak limit in MN; 

(4) the “sphere” K(R) specified by the inequality |g | < R is weakly closed 
and weakly compact—that is, every sequence in K(R) contains a weakly 
convergent subsequence with weak limit in K(R). 





On the other hand, Theorems 1-3 can be deduced from these four propositions 
in an obvious way. 

We turn now to a problem in operator-theory. We prove* 

TuHEeoreM 4. If the operator A has 2 as its domain, then the following properties 
are equivalent: 

(1) A is a closed linear operator; 

(2) the domain D(A*) of the adjoint A* of A is everywhere dense in &; 

(3) A is bounded and linear. 

When A has any of these three properties, D(A*) coincides with % and A* has the 
same bound as A. 

We establish the implications (1) — (2), (2) — (3), (3) — (1), proving inci- 
dentally that (2) implies D(A*) = &. 

Assuming (1), we form the graph @(A) of A in the direct sum’ 2 @ 2 and apply 
reasoning due to von Neumann.’ (A) is the set of all elements of the form 
\f, Af}, where f belongs to &%, the domain of A. Since A is closed and linear, 
(3(A) is a closed linear manifold. In fact, the identity 


[fn — SP + | Af — SP = | (fa, fed — (604 


shows that {f, , Afn} — {f,f*} if and only if f, — f, Af, -f*; and the assumption 
that A is closed permits us to conclude the relation f* = Af or the equivalent 





4 J. v. Neumann, Uber adjungierte Funktionaloperatoren, Annals of Mathematics, vol. 33 
(1932), pp. 294-310, especially Satz 12, p. 310. See also reference to Tamarkin in footnote 
on p. iv of the foreword to Stone, Linear Transformations in Hilbert Space, New York, 1932. 

5 For definitions of terms used, see Stone, loc. cit. 

6 Stone, loc. cit., p. 30. 

7 J. v. Neumann, loc. cit. 
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relation {f, f*} «@(A). Similarly, the assumption that A is linear leads to the 
relations 


p> ax{ fi, Afx} = {> afr , > Af) = ‘= afr A(X ahs) @(A). 


The orthogonal complement of @(A) is a closed linear manifold @*(A), and since 


(A) is a closed linear manifold, it coincides with the orthogonal complement 
of @*(A). Now the identity 


(Af, 9) — (f, 9") = (tf, Af}, {-9*, 9}) 


shows that {—g*, g} « G*(A) if and only if (Af, g) = (J, g*) for every f in ¥ 
in other words, if and only if g « D(A*) and g* = A*g. Since D(A*) is obviously 
a linear manifold, we can prove that D(A*) is everywhere dense in ¥ by showing 
that the only element h orthogonal to D(A*) is the element h = 0. If h is ortho- 
gonal to D(A*), then 


({0, hk}, {-—g*, g}) = (A, g) = 0 


for all elements {—g*, g} in G*(A). Hence {0, h} is an element of @(A), the 
orthogonal complement of @*(A), and the relation h = AO = 0 is valid. We 
have thus deduced (2) from (1). 

Now let us assume (2). If g is an arbitrary element of %, we choose a sequence 
{gn} in D(A*) such that g, — g. Since the relations (f, A*g,.) = (Af, gn) — 
(Af, g) hold for each f in 2, an application of Theorem 2 establishes the existence 
of a unique element g* such that (Af, g) = (f, g*) for every f in %. Hence g « 
D(A*) and A*g = g*. Our first consequence of (2) is therefore the identity 
D(A*) = & Since A and A* are both defined over &, it is evident that the 
adjoint A** of A* coincides with A and that A* shares property (2) with A. 
The identity A = A** shows that A is linear. If it were not bounded, there 
would exist a sequence {g,} with the properties 


lim inf |g,| < +, lim | Agn| = +; 


indeed, the first of these properties could be replaced by the stronger property 
|gn| = 1. Now the equation (f, Ag,) = (A*f, gn) holds for all f in &. Hence 
we could apply the sufficient condition of Theorem 3 to the right-hand member, 
the necessary condition of the same theorem to the left-hand member, so as to 
infer the relation lim inf | Ag,| < + «. We would thus obtain a contra- 


diction. Hence A must be bounded. Since our argument can be applied also 
to A*, the latter operator is also bounded. If a is the bound of A—namely, 
the least non-negative real number such that | Af| < a|f| for every f in ¢— 
and if a* is the bound of A*, then the inequalities 


|Af |’ = (Af, Af) = (A*Af, f) S |A*AS||f| S a* |AS||F |, |Af| Sat |S) 


show that a < a*. By symmetry we have also a* S a and hence a = a*. Thus 
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we have proved not only that (3) is a consequence of (2), but also that, if A 
has property (2), A* is defined over ¥ and is a bounded linear operator with the 
same bound as A. 

Finally, the implication (3) — (1) is trivial, for if A is a bounded linear operator 
with & as its domain and a as its bound, then the relation f,, — f implies 


| Af. — Af| =| Ai -f)|salf. —f\— 0, Af, — Af, 


and A is therefore closed. 

A very useful application of this theorem is the following result due to Hel- 
linger-Toeplitz.° 

TueoreM 5. The following properties of an infinite matrix {Amn}, m,n = 
1, 2, 3, --+ , of complex elements are equivalent: 





(1) {ann} 7s bounded—that is, there exists a non-negative constant a such that 


N |) N 2 N 
| a2,| Sa* Do |z,/? 
u=1 | v= v=] 
for arbitrary complex numbers x, , +++ , 2» and arbitrary integral N; 
oo 


(2) the convergence of : |x, |" implies the convergence of 
v=] 


oc | 2 


> id a,,2,| ; 


p=l | v=1 


(3) the convergence of > | 2,|? and > | y, |* implies the convergence of the 
v=l p=l 


double series 


- eo 
} Ay» 2, Gu ; 


uw 
oo oo 
(4) the convergence of 7 | a, |" and > | y, |* implies the convergence of the 
v=] 


oe x _— 
series ye (= tyes) Yu - 


p= p=] 


These properties are equivalent to the corresponding properties for the adjoint 


. * * , ° . 
matrix {Amn}, Where Ann = Anm- When any of these equivalent properties holds, 
we have 


yy Ay yGy = 3 (> a2) i. = bs ry (= woot): 
— 


be w=1 \v=1 v=] 
We shall establish the implications 
az (2) 


N\ 
(1) > (3) 3) 
* (4) (2) 


§ Hellinger-Toeplitz, Mathematische Annalen, vol. 69 (1910), pp. 289-330, esp. pp.321-322. 
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and shall show further that (1) implies the final statement of the theorem. 
Since we have 


= On, YrLy = ya Any 2rYu » 
Bw aml 
we see that property (3) holds for a matrix if and only if it holds for the adjoint 
matrix. Thus the equivalent properties (1)-(4) are equivalent to the cor- 
responding properties for the adjoint matrix. 

We shall obtain proofs by applying Theorems 1, 2 and 4 in the concrete 


Hilbert space po of all sequences x = {2,, 22, 23, *** } for which > | z, |? 


v=1 
converges. In §o, the elements £&, = {din , den, dsn, *** $, Omn = Ofor m # n 
and dn, = 1, where n = 1, 2, 3, --- , constitute a complete orthonormal set. 
Hence the linear manifold 9 which they generate is everywhere dense in bp . 
We denote by Ay the operator which carries z = {2,, 22,23, +++ } into Ayr = 
N N 
‘2 Gy2,,°**, 2) yt, 0,0, «+> \ It is evident that Ay is a linear operator 
v=l1 


v= 
with §o as its domain, and it is easily shown that Ay is bounded and has an 
adjoint Ax. 
Assuming (1), we first observe that, by virtue of the inequalities 


a 2 N Ea 
Ayr = | Daz, so QDVii2f sad |z,[ =a*\2/ 
v=1 


w=l | v=1 \ v=1 





the operators Ay , N = 1, 2,3, --- , are uniformly bounded. Since (Awé,, Em) 
= Ann for N = mand N 2 n, the sequence {(Awz, y)} converges for all x, y 
in M. Now if z, 2’, y, y’ are arbitrary elements of So, we have 


| (Aux, y) — (Awa, y)| S | (Au(x — 2’), y) | + | (Av (@ — 2’), y) | 
+ |(Auz’,y — y')\ + | (Awa, y — y') | + | (Aue’, y’) — Ave’, y’) 
S 2a\x—2'|\y| + 2a|2'||y — y’| +| (Awz’, y’) — (Avz’,y’) |. 


We can render the first two terms in the final expression small by choosing 
zx’ and y’ in M so as to approximate z and y respectively; and we see that the third 
then becomes small for all sufficiently great M and N. The sequence {(Awz, y)} 
therefore converges for all z, y in S». Since Ay has adjoint Ay with Gp as its 
domain, the equation (Ayz, y) = (x, Axy) shows that the sequence { (x, Ax y)} is 
likewise convergent. According to Theorem 2, there exist unique elements 
Az, A*y, such that 





(Ayx, y) — (Az, y) = (2, A*y) 


for all z, yin $9. The operators A and A* so defined have bp as their common 
domain, are both linear, and are adjoints of one another. According to Theorem 
4, we see that in addition A and A* are both bounded. We note in particular 
the relations 
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(AEn, Em) = Amn, (A*En, Em) = (Er, AEm) = Gam = Onn. 


x oo 
Writing z = Dm (x, &)& = > x,t, for the arbitrary element x = {21, 22,23, -°-+ } 
v=1 v=1 





and applying the linearity and continuity properties of A, we now have 


|Ar|? = > | (Az, &) ? = 0 | (a(x n&), ‘) 


“= p= v=1 

o |; 2 Oo | 2 12 
= D | dX (Ab, , &) 2 == Do | De Gyr ty| 

p=1 | v-=1 | u=1 | »=1 | 


Thus (1) implies (2). Also, putting y = yi yj, we find similarly that 
pel 


(Aya, y) = (= (= tet.) be, ») = Xu Ay, X,(E, , y) 


w= v=1 


N 
-” 2 Oyy Xs Ypy bites (Ag, y). 
1 


v= 


Thus (1) implies (3). By analogous reasoning we have 


(Ag, y) = > (Az, &,)(E, ’ y) = > (= tt») Yu 


w= v=1 


(Az, y) = (x, A*y) = p> (x, &)(é, A*y) = p> 2,(A*y, &) 


= >» wT, > doen) 
v=1 =1 
Thus (1) implies (4); and also implies the equality of the sums of the three series 
expressing (Az, y). If we denote by Ey the projection of So on the (closed) 
linear manifold generated by &, --- , vy, we see that Ay = EyAEy. It is 
thus easy to show that Awx converges in $» to Az, and that the bound of A 
is the least constant a for which the inequality of (1) holds. 

Next let us assume (2). We can then define an operator A which carries 
x = {%, %2, %3,°:: } into Ar = «>> ayaz,, >, ayt,,--: >. It is evident 

v=1 v=1 


that A is a linear operator with So as its domain. Since 


(x, Axém) = (Aw2, &m) = > Amy Ly —> Dy Amy, = (Ax, Em) 


v=1 


when N becomes infinite through values exceeding m, we can apply Theorem 2 
to write (Az, &,) = (2, *) for m = 1, 2,3,---. Hence A has adjoint A* de- 
fined throughout the linear manifold Mt. By Theorem 4, the operator A is 
bounded. If ais the bound of A, we have 
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co) ] o 


Y| Sox) = [Ari sa'lzf =a Dla! 
p=1 | -=1 | v=] 

Applying this inequality to the case where x = {2,, %,--:,2y,0,0,--> }, 

we have 


N 


Flt os! oh 


p=l | vm | p= 








N 
Yawz,| <a) |z,/ 


v=1 v= 





Hence (2) implies (1). We may remark that the operator A defined in the 
present paragraph is now easily identified with the operator A introduced in the 
preceding one, by virtue of the relation (Awz, &,) — (Az, &,) which holds for 
both operators A. 

Next let us assume (3). If x = {2,,22,273,--: }andy = {y,ye,ys,°°* } 
are arbitrary elements of Sp , we see that 


(Awz,y) = z Oy, t,Y, = (x, Avy) 
converges. By Theorem 2 there exist unique elements Az, A*y such that 
(Az, y) = p> Ayr try, = (x, A*y), 
for all z, yin So. The operators A and A* thus defined have §» as their com- 
mon domain, are both linear, and are adjoints of one another. According to 


Theorem 4, they are both bounded, with common bound a. Consequently 
we have 


. —|* 2 2 2 2 - 2 — 2 
LY zy) Sa lzPlyP=o Dlzl L ly. 
av=l | v=1 p=1 


Applying this inequality to the case where x = {m,-++-, ty, 0, 0,--- } and 
= Aya, we have 








| N |2\2 N 
|| a2. | <a’ 2D |2,/ rf > ay ’ 
= om i oun = 
and hence 
N N N 
b> Oy, my = <a } | wy |? 
p=1 | r= v=1 








Thus (3) implies (1). Again the operator A introduced here is easily identified 
with those previously defined. 

Finally, we assume (4). Then for arbitrary x = {x, 22, 23, --* } in Go the 
series > Amy2, converges to a sum z,, ; and for arbitrary y = {yi, Y2, Ys, °°* } 


in §o, the series pm Zuy, converges. Let ¢y be the element {z, 2, --: 
p= ° 


’ Zn) 
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i 
0, 0,--- } in So. Then (fy, y) = » Z.y, converges for arbitrary y in ©». 
u=l 
Hence Theorem 1 shows that 
. 


N oo 
lew? = Deel = 2 LD Gyr, 
= 


p=1 | v=] 


2 


remains bounded. We see therefore that (4) implies (2). 
With this the proof of our theorem is complete. 


HarRVARD UNIVERSITY AND BROWN UNIVERSITY. 




















THE STRUCTURE OF CERTAIN RATIONAL INFINITE ALGEBRAS 
By Orro F. G. ScuILiine 


G. Kéthe has investigated infinite algebras over abstract fields in his paper 
“Ueber Schiefkérper unendlichen Ranges”.’ In this note we shall apply some 
of his fundamental results to infinite algebras over a finite algebraic number 
field. Application of the theory of finite algebras over algebraic number fields 
enables us to give explicit representations of the algebras considered as general- 
ized crossed products.” Furthermore, we shall investigate the arithmetic in 


such infinite algebras. 


1. Algebraic theory. Let k be an algebraic number field of finite degree 
over the field of all rational numbers. We consider algebras of infinite rank over 
k as centrum. We assume that the algebras A are countable, i.e., that there 
exists a countable set {a,, a2, --- , ai, **+} of elements of A, such that each ele- 


ment a of A can be represented as a finite sum > k;, a:, with coefficients k;, 


v=l 
in the field k. Furthermore, we restrict ourselves to completely normal alge- 
bras which we define as follows. 

DeriniTion. A countable infinite algebra A with centrum k is called totally 
normal over k if every finite system {b,, --- ,}b,} of elements of A lies in a 
normal simple finite algebra over k. 

With slight. modifications of Kéthe’s proofs one proves 

THEOREM 1. For every totally normal algebra A there exists at least one defining 
sequence k © --- © Ais & Ai ] --: of normal simple algebras A;. 

Conversely, we have 

THEOREM 2. Every sequence kk © --- & Ain & Ai &-:: of normal simple 
algebras A; over k defines an infinite totally normal algebra A with the center k. 

THEOREM 3. Every totally normal algebra A over k is representable as the direct 
product of a countable infinity of normal simple algebras A; of finite degrees over k. 
Such a decomposition is not necessarily uniquely determined. 

If we collect all simple normal systems of such a decomposition which belong 
to a fixed prime g, we have 

THEeoreM 4. Every totally normal algebra A over k is the direct product of 
a countable infinity of simple algebras A‘ which are primary with respect to k. 
The factors are uniquely determined except for isomorphisms. 


Received December 23, 1936. 

1G. Kéthe, Math. Annalen, vol. 105 (1931), pp. 15-39. See this paper also for the differ- 
ent notions used later on. 

? For the theory of crossed products see the report of Max Deuring, Algebren, Berlin, 
1935. 
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Thus the determination of structure is reduced for this type of algebra to 
that of the structure of finite or infinite totally normal primary algebras A“. 

Now let us consider a direct product A = A; X --+ X A; X --> of infinitely 
many finite primary algebras A;. Then As =A, X-:-X A, = D, X Re. 
with a normal division algebra D, and total matric algebra k,, of degree r, 
over the field k. K6éthe has shown 

Turorem 5. If the degrees r,, of the matric algebras successively contained in A* 
tend to infinity with increasing n, the infinite direct product A is isomorphic to the 
direct product M of infinitely many matric algebras k,. 

With the help of Theorem 3 and the theory of finite normal simple algebras 
over an algebraic number field k, we prove 

THEOREM 6. There do not actually exist infinite totally normal primary division 
algebras D over a finite number field k. 

Proof. It is a well-known fact that the direct product of two normal finite 
division algebras D, and D, over k is a division algebra if and only if the ex- 
ponent of D,; X Dz is equal to the degree of D,; X D.. The degree of D,; X De 
is equal to the product of the degrees of the factors. Now let D, and De be 
primary division algebras with the respective degrees q“' and q%, a1, a2 both > 0. 
Then the algebra D, X Dz has the degree q@'*™. 

On the other hand, the index of an algebra is determined as the least common 
multiple of the local exponents or indices.’ According to the theory of finite 
normal algebras over an algebraic number field, the local invariants of a product 
are equal to the sum of the respective local invariants of the factors. The latter 
can be represented for every prime ideal p of k as fractions with the maximal 
denominators q™*““' *. The possible exponent in the large is then at most 
gq’) * But this means that the direct product of two primary finite 
division algebras D, and Dz, must necessarily split off a matric algebra, because 
max(a;, a2) <a,+ a2. We apply this to the products D, X Dz, X --» X D, = 
Di‘, X k,,, where D%, is the division algebra which belongs to the product. Then 
one sees immediately that the degrees r, of the matric algebras tend to infinity 
with increasing n. According to Theorem 5 all these products are isomorphic 
to the infinite product of matric algebras k,. Every division algebra D can be 
represented as an infinite direct product of finite division algebras as stated in 
Theorem 3. But our investigations of direct products of primary division 
algebras show that they must be isomorphic to M. 

There remains then the study of totally normal algebras which are not iso- 
morphie to an M. They must be representable as infinite direct products of 
primary finite normal simple algebras, as Theorems 3 and 6 show. 

We apply the theory of structure as developed by H. Hasse’ to this problem. 
He shows that every finite normal algebra A over the algebraic number field k 


° Cf. footnote 2. 

‘Cf. footnote 2. 

*H. Hasse, Die Struktur der R. Brauerschen Algebrenklassengruppe wiber einem algebrai- 
schen Zahlkérper, Math. Annalen, vol. 107 (1933), pp. 731-760. 
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of degree n can be represented as a cyclic crossed product (a, Z/k, u), where Z is 
cyclic over k of degree n and a is an element (+ 0) of k, u an operator, describing 
the algebra A with respect to the splitting field Z. The complete factor set 
of this representation is given by the square matrix of n rows and columns 


D eeeceeus 1 

. “a 

F(a) a4 : roy 
onc ass a 


Therefore we may write A = (F‘"’(a), Z/k, u). 


Now we consider two prime’ cyclic products A; = (F‘""’ (a), Z,/k, uw) and 
A, = (F“ (ag), Z2/k, ue). Their direct product is isomorphic to a cyclic algebra 
with the splitting field Z7,Z.. This field Z,Z2 is cyclic over k, because we as- 
sumed A; prime to Az. The Galois group of Z,Z2/k is isomorphic to the direct 
product of the Galois groups {S,} and {S:} of Z;/k and Z./k. For this reason 
we shall represent A; X Az by the symbol 


(F“"” (a) @ F“"*) (ag), Z:Z2/k, {th, us}), 


where F“"? (a;) @ F'"? (az) is an abbreviation for the factor set arising from the 
(nyn2)” products 


uy -us? = af'as*-uit'-ud’, 

‘ : (ny) x(ng) 

if m, = gin, + yw and m2 = gone + wo. This factor set F’"" (a,) @ F‘"® (az) has 
the form 


F(a) — F(a) 
; ae: F"" (ay) 


F(a) ag: F” (ay) = de: F (ay) 


with m2 rows and columns. It is the multiplication table belonging to the nn2 
elements of the complex {u;, we} in the ordering 


2 a | 2 —1 2 2 
l,w,uj,***,u, Us, Uz, ***, Us” , Uitle, Uil2, *** , Ue, 
ry bd 7 ) 4 
The matrix F‘"®(a.) @ F‘"” (a;) is generated as the tableau of 


noq—1 ni—1 


2 2 2 2 
1, Ue, U2, °***, Ue” , U1, Uy, °°", Uy Us, Us, *** , Ue, 


- 


Obviously F? (az) @ F‘"?(a;) can be obtained from F'"? (a) @ F‘"? (a) by 
interchange of rows and columns, and vice versa. 

Furthermore, it is required that U2 Ze = 2% and u; Zu, = 22 for arbitrary 
elements 2;, 22 (~ 0) in Z,; and Z,. We shall not reduce this representation of 
A; X Az to the cyclic form (F“ "? (ay), Z:Z2/k, uy). 

If we form the union Z of infinitely many finite algebraic number fields Z; 


® Two cyclic products are said to be prime if their degrees are prime. 
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which are prime over k and cyclic, this field Z is an infinite cyclic extension of k. 
The direct product z of all the cyclic Galois groups of the fields Z;/k is a group 
of automorphisms of the field Z/k. It is contained in the most general group of 
automorphisms of Z/k as introduced by J. Herbrand’ and W. Krull’ by topo- 
logical methods. All elements of z are of finite order by definition of the direct 
product. 


Now let us consider infinitely many crossed products A; = (F‘"" (a,), Z;/k, us), 
which are prime with respect to k. Then the infinite complex {w, ue, --- } 
with the relations 

m mi “ oi @g 
ui, 266 % us;! — us. oe eens ui, tj a; % e eee -azi! 


with m;, = gi;ni; + mi; for every finite product of operators forms a crossed 
representation of the group z. The factor set arising above may be called 
F(a) @ F"?’ (az) @ «++. We define then the infinite algebra 


A = (F(a) @ F'"? (a2) @ «++ , Z/k, {ur, ue, ++ }) 


Pry Pre 


as the set of all finite sums >> Kees +++ 0911 +*+ 44 Se, ° °° Be, Ue, °°* Ue,’ With ele- 
o,f 


ments k... in the field k. 

The laws of composition are completely defined by the relations 

(i) TI wri = (fof! - TT wis 

v=l v= v=l1 
(i Uz 2 jus = 2; (i # j) for every z; ¥ 0 of Z;; 
ii 

, uj zu; = 2; for every z; ¥ 0 of Z;. 

We shall call such an infinite crossed product with cyclic Z/k a generalized cyclic 
product. Then we have 

THEeorEM 7. The generalized cyclic products A are totally normal over the 
field k. 

Proof. It is obvious that the finite direct products A; X Az X -:: X Ai 
form a defining totally normal sequence of A. Theorem 2 asserts that A is then 
totally normal. 

THEOREM 8. Every proper’ infinite totally normal algebra A over k can be 
represented as a generalized cyclic product. 

Proof. Let A be decomposed according to Theorem 4. The components are 
finite simple normal algebras A; which are prime with respect tok. Theorem 7 
leads to possible representations. We remark that these representations are 
not at all uniquely determined. 


7J. Herbrand, Extensions algébriques de degré infini, Math. Annalen, vol. 108 (1933), 
pp. 699-717. 

8W. Krull, Galoissche Theorie der unendlichen algebraischen Erweiterungen, Math. 
Annalen, vol. 100 (1928), pp. 687-698. 
® That is to say an algebra which is not isomorphic to an infinite matric algebra M. 
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2. Arithmetic theory. We first introduce some notions already known in 
the theory of infinite algebraic number fields." Let k = Ap © --- © Avi © 
A; © --- be a defining sequence of normal simple systems of the totally normal 
infinite algebra A over k. 

Derinition. We say that the series Ro C--- C R:, CR; C-::: of 
rings R; in A; is a sequence “belonging to A = {A,}” if and only if 

(i) the rings R; are of maximal rank in A,;, 

(ii) Ava NR; = Ryo. 

We consider also ideals a; in the rings R;: left, right, and two-sided ideals. 

DEFINITION. A series @ C--: © ais € a; ©--- of ideals is called a 
sequence of ideals belonging to the sequence of rings R; if and only if 

(i) a, OR, aR, C aj, 

(i) Ry Nn a; = aj-1. 

Condition (i) is to be changed according to the nature of the ideals. 

Then we have 

THEOREM 8. Between the rings R of A and the sequences of rings {R;} can be 
established a one-to-one correspondence, which is explicitly given by 


iR;} —~R = rR, R— {RN A; =R;}. 


THEOREM 9. Between the ideals a of the rings R in A and the sequences of 
ideals {a;} belonging to the sequence of rings R = {R,} there holds a one-to-one 
correspondence according to the formulas 


la:} >a = Daj, a— jaf) R; = aj}. 


For the proofs one has only to recall the different definitions. 

DeriniTion. A ring J of A is called a maximal order if 

(i) J contains the maximal order of k, 

(ii) all elements of J satisfy minimal equations with coefficients in the 
maximal order of k, 

(iii) J is not contained in a larger ring, which fulfills the conditions (i) 
and (ii). 

We now construct special maximal orders of A. Let A = A; XK Ao X-::: 
xX A; X --- be a representation of A according to Theorem 3. In each system 
A; we fix an arbitrary maximal order M;. Then we form successively the 
direct products 


M; X Me © Mw C A; X Ae 
My x M; Cc Mi3 eg Ay x Ao x As 


ee 


My... i-1 x M; Cc M,z... ; C Ay x As x aii x Aj. 


10 W. Krull, Idealtheorie in unendlichen Zahlkérpern, Math. Zeitschrift, vol. 29 (1929), 
pp. 12-54. 
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We denote by My ... ; the uniquely determined" maximal order of Ay X Az X --- 
x A; which contains the order My... ;1 X M;. All these hypercomplex orders 
have the highest possible rank. 

We proceed to 

THeoreM 10. The sequence of maximal orders {My... ;} defines a maximal 
order J of A. 

Proof. Let us assume that J is not a maximal order. Then J is contained 
in at least one maximal order J’. Hence there must exist an element a of J’ 
which does not lie in J. Since a is an element of a maximal order J’, its minimal 
equation has integer coefficients ink. It must belong to a finite subalgebra of A. 
Let A; X As X --: X A, be this algebra. Then M’ = {J,a}M A, X AoX -:- 
x A, DM,...,. This means that in the finite normal algebra A; X Az X -:: 
x A, there must exist an order M’ properly containing the maximal order 
M;...,, but this is clearly impossible. 

THEOREM 11. There exist continuously many maximal orders in the algebra A. 

Proof. This follows immediately by considering the construction of the 
special maximal order J. At each step we can choose infinitely many maximal 
orders M; in the algebras A; to form the maximal orders My ... ; of an approxima- 
tion. The maximal orders My»... ; are different for different M;,. To prove 
this let M; and M; be two different maximal orders of A;. Assume that the 
uniquely determined maximal orders My... ; and Mjg...; in Ay X «+: X A; 
which imbed M;_, are equal. Then the intersections My... ; MA, X --- XK Ai 
and Miz...; M Ai X +++ K Ay are equal. This intersection is obviously an 
order of Ay X-+-- X Ajy. It contains the two different maximal orders 
M; and M;. It therefore is equal to both, in contradiction to the assumption. 
This construction leads then to N.“o = S& different maximal orders of A. 

Now we consider the decomposition of prime ideals p of the center k. It 
suffices to consider the p-adic extensions A, of the given totally normal algebra 
A.” We understand that the extension Ap is the product modulus A -ky. 
Obviously A, is a totally normal simple algebra with the center kp.” In the 
representation (A;)p X --- X (Ai)p X --: of Ap as a direct product, we can 
combine all finite local division algebras (Dp); of (A,)p into a local division 
algebra Dp» which is also totally normal over kp. 

This reduces the problem of decomposition to the two partial problems: (1) 
arithmetic in a totally normal division algebra Dp over kp, (2) arithmetic in 
systems of matrices over such a division algebra. We prove 

THEOREM 12. A totally normal division algebra Dy» over a p-adic number field 
kp contains exactly one maximal order Op», and there exists one prime ideal P in Dp». 


" T. Nakayama, Uber das Produkt zweier einfachen Algebren mit zu einander teilerfremden 
p-Indizen, Jap. Journal of Math., vol. 12 (1935). 

12 EF. Noether, Zerfallende verschrénkie Produkte und ihre Mazimalordnungen, Actualités 
Scientifiques et Industrielles, 1934. 

13 Consider the approximation by products of the (A;)p. 
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All other ideals are two-sided and they are determined by their values with respect 
to P and their symbols. 

Proof. Let --- & (Ds)in & (De); & --- be a defining sequence of finite 
p-adic division algebras of degrees n; over kp. These algebras contain exactly 
one maximal order (Op); with one prime ideal P;. The value with respect to 
P; may be denoted by vp,."* The sum > (Op); is obviously a maximal order 
O» of the algebra Dp. By definition, Op contains all elements of Dp whose 
minimal equation has integer coefficients in the field kp and the highest coefficient 
is one. Let O} be another maximal order of Dp. All its elements satisfy 
minimal equations with integer coefficients in kp. Therefore 0; Cc O,p. But 
O; was assumed to be a maximal order. Hence O; = Op. 

The sequence --- © P,_, © P; € --- of prime ideals defines the prime ideal P 
of the maximal order Op. Let a be an ideal in Op. Its defining sequence 
\a; = a 1 (Op);} of ideals in (Op); consists of two-sided ideals. Hence a itself 
is two-sided. The value vp with respect to the prime ideal P is defined by vp(a) 
= lim vp,(a)-n;", if a lies in the algebra (D»);, and not in (Dp);,1. It is vp(a) = 


i~e2 


Up) (a) “ni, - The value vp(a) of an ideal a is defined by lim inf vp(a). The 


a¢ea 
symbol s(a) is equal to f (finite) if and only if there exists an element a in a 
such that vp(a) = vp(a). The symbol s(a) = 7 (infinite) in all other cases. 
Then all ideals of Op are uniquely determined by their values and symbols.” 
This theory runs exactly as in the commutative case. 
If we consider a finite matric algebra (Dp), over Dp, then (Op), is a maximal 
order of (Dp) m. 
THEOREM 13. The two-sided prime ideal P of (Op) is given by 
Pp... p\™ 
P(O,)m = 
p...p 
and it is the common part of m left ideals L; which are given in the form 


O, --- P--- 0,\™ 
L; = 


’ 


O,:::P--- 0, 


the P’s being in column j. 

Proof. These facts are quite obvious.” 

Now we consider the product Ap» of infinitely many matric algebras (kp),., 
with the totally normal division algebra Dp». 


14 For the notion of value, etc., in division algebras see H. Hasse, Uber p-adische Schief- 
kérper und ihre Bedeutung fiir die Arithmetik hyperkomplexer Zahlen, Math. Annalen, vol. 
104 (1930), pp. 495-534. See this paper also for the proofs of the following theorems. 

16 Cf. footnote 10. The algebra Dp is by no means perfect with respect to the valuation 
. = 
16 Cf. footnote 14. 





310 OTTO F. G. SCHILLING 


THEeoreM 14. The two-sided prime ideal P of the maximal order Op X TI(0p) ». 
= J is divisible by continuously many left ideals in J. (op is the maximal order 
of kp .) 


Proof. The series op € --» & Op X I (Op); & Op X II (Op) m; 


j7=1 
is a defining sequence for a maximal ae a J of Ap. The ideals P x Op X 


Il (Op); are two-sided prime ideals of the maximal orders Op X I (Op) m,; - 
j=1 


They define the tw o-sided prime ideal P of J. Let L, be one of the le it divisors 


of P K Op X I (Op)m; in Op X a (Op);- Their number is surely greater 


than one. Then all of these ideals L, ;-. are divisible by more than one left ideal 

L; of Op x I] (Op) m;- This construction leads again to at least 2%. = & left 
7=1 

divisors of P. 

Remark. A combination of these results with the explicit construction of 
finite division algebras shows that there exist totally normal division algebras 
in which exactly two prime ideals p and q are totally ramified, i.e., Dp and D, are 
both infinite division algebras. There also exist totally normal division alge- 
bras, all p-adic extensions of which are equivalent to the product of a finite 
division algebra with an infinite product of matric algebras. 


Tue INSTITUTE FOR ADVANCED StTupy. 





AN EXTENDED ARITHMETIC 
By Garrett BiRKHOFF 


1. Introduction. In this paper there are defined three combinatorial opera- 
tions upon partially ordered sets X and Y resulting in partially ordered sets 
which will be denoted by X + Y, XY, and X” respectively. 

In the case where X and Y are finite unordered sets with cardinal numbers 
m and n respectively, the operations yield the finite unordered sets of cardinal 
numbers m + n, mn, and m” in the commonplace sense. In the more general 
case where the requirement of finiteness is dropped, they yield the usual founda- 
tions for the arithmetic of general cardinal numbers.’ 

It will be proved that the formal properties of general cardinal arithmetic’ 
persist as laws of composition for our extended arithmetic of general partially 
ordered sets. On the other hand, no algorithm for well-ordering the class of 
partially ordered systems is given, and so the theorem of transfinite arithmetic 
which asserts that the cardinal numbers are well-ordered has no analogue. 

It will also be shown that our extended arithmetic is of considerable use in 
describing algebraic systems. Thus although it differs from Hausdorff’s ordinal 
arithmetic when applied to sequences, it is apparently more consequential than 
the latter. 


2. The extended arithmetic. The extended arithmetic which is proposed will 
be described by defining first the domain of elements to which its operations 
apply, and then defining the resultants of its operations. 

The elements are partially ordered systems in the usual sense of Hausdorff— 
that is, systems X, Y, Z, --- whose members (denoted by small Latin letters) 
are related by an inclusion relation x < 2’ satisfying 


Pl: « S x. (Reflexiveness) 
P2: « S 2’ and2’ S zimply z = 2’. (Anti-symmetry) 
P3: 2 S$ 2’ and2’ S 2” implyz S< x”. (Transitivity) 


Two partially ordered systems X and Y will be called isomorphic (written 


Received May 22, 1936; in revised form, December 31, 1936. 

1 Cf. F. Hausdorff’s Mengenlehre, Berlin, 1927, p. 29, p. 62. 

? The possibility of such relations as z + 1 = z, 2x = z, and z* = z distinguishes general 
from finite cardinal! arithmetic. 

’ Although transfinite numbers haye numerous uses, the only constructions of sums, 
products, or powers of transfinite numbers which have been really interesting hitherto 
have been those of (1) the power of the continuum as 2No (No denotes countable infinity), 
and (2) transfinite ordinals such as w + 1, 3w, or w? by ordinal addition and multiplication, 
and only the second of these is lost in our extended arithmetic. 
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X = Y in this article) if and only if there exists a one-one correspondence 
between their elements which preserves inclusion. 

By the swum X + Y of two such systems X and Y is meant the system whose 
members include both the members x of X and the members y of Y, in which 
x 2x'andy S zy’ keep their meaning, while x S y and y S 2 are always denied. 

By the product X-Y of X and Y is meant the system whose members are 
all couples [z, y] with « «eX and y ¢ Y, and in which [z, y] S [2’, y’] means 
thatz S 2’ inX andy S y'inY. 

By the power X” of one such system X with respect to a second such system 
Y as exponent is meant the system whose members are the monotonic functions 
f(y) with domain Y and range in X (that is, all functions such that y S y’ in Y 
implies f(y) S f(y’) in X), ordered by having f S g mean that f(y) < g(y) 
for all y. 

The reader will have no difficulty in verifying that X + Y, X-Y and X” 
are partially ordered systems. Further, if X and Y are lattices,‘ then so is X-Y. 
Moreover any laws such as the modular and distributive laws which hold in 
X and Y hold in X-Y. While if X is a lattice and n is the power of Y, then 
X” is a sublattice of X", and so X” is a modular resp. distributive lattice if X is. 

The proofs of these facts will be omitted. Also, we shall not prove that the 
above definitions yield an extension of the cardinal arithmetic of Hausdorff— 
this is evident if one looks at Hausdorff’s definitions (loc. cit.). 


3. Applications. It is interesting to consider various arithmetic combina- 
tions of especially simple partially ordered systems, which have an independent 
algebraic importance. 

As regards the simple systems, we shall let n denote the unordered aggregate 
of n elements, C, the sequence of n elements (n finite), symbols Nq the trans- 
finite cardinals (= unordered aggregates), and adopt the conventional notation 
for transfinite ordinals. Finally, we shall let B = C2 denote the Boolean 
algebra of two elements, and P,, the one-dimensional projective geometry with n 
points on its line. Then 


(3.1) The finite Boolean algebras are the B”. 

(3.2) The Boolean algebra of all subsets of any aggregate of power S is BY. 
(3.3) The finite distributive lattices are the B“, where X denotes a variable 
finite partially ordered set. 

(3.4) The “quotient-lattice” associated by Ore’ with each abstract lattice L 
is L”. 


4 By a “‘lattice’’ is meant a partially ordered system LZ in which any two elements x 
and z’ have ag.l.b. x M x’ such that 2’’ < +f x’ means that 2’’ S$ zand 2” < 2’, and 
al.u.b. « U 2’ such that z’’ = 2U 2’ means that 2’’ > zand 2’ = zx’. By the modular 
law is meant the law that z S x’’ implies z U (x! N 2’’) = (x U 2’) N x”; the distributive 
law asserts that (2 U x’) N (2’ U 2") N (2"" Uz) = (x Nw’) U (2? N 2’) U (2 N 2). 

5 On the foundations of abstract algebra. I, Annals of Math., vol. 36 (1935), p. 425. Ore 
calls lattices structures. 
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(3.5) The integers ordered with respect to divisibility (the relation m | n) are a 
sublattice of w™°. 

(3.6) The “free” Boolean algebra generated by n symbols is B”. 

(3.7) The “free” distributive lattice generated by n symbols and with 0 and J 
added, is B””. 

(3.8) The “free’’ modular lattice generated by three symbols is a sublattice 
of B -P; ° 

(3.9) The most general configuration generated in n-space by an r-plane and 
an s-plane through the origin, after iterated sections, linear sums and taking 
of orthogonal complements, is B*:P,. 


The proofs of (3.1)-(3.9) are various. Assertions (3.1), (3.2) and (3.6) are 
known.” <Assertions (3.3) and (3.7) are proved in the author’s article, Rings 
of sets, which will appear in the next issue of this journal. Inspection of Ore’s 
definition, which is identical with our definition of L”, yields (3.4). Statement 
(3.5) is a corollary of the unique representation of any integer as a product 
of powers of ascending primes. The result (3.8) has been proved by the author 
(On the structure of abstract algebras, Proc. Camb. Phil. Soc., vol. 31 (1935), p. 
443, Theorem 14), while (3.9) has been recently proved by J. von Neumann; 
the proof will be published elsewhere. 


4. Arithmetic identities. It is obvious that addition and multiplication are 
commutative—in symbols, that 


(4.1) Xx+Y=Y+X and X-Y=Y-X. 
They are also associative—that is, 
(442) X+(¥+2Z)=(X+Y)+Z and X-(¥-Z) = (X-Y)-Z. 


For both (X + Y) + Z and X + (Y + Z) consist of all members of either 
X, Y, or Z, with the provision that x < 2’, y S y’, andz S 2’ preserve their 
meaning, while x < y,z 2y,y Sz,y 22,2 S x, andz = 2 are always denied. 
And both X-(Y-Z) and (X-Y)-Z consist of all triples [x, y, z] with xX, 
yeY and zeZ, where [z, y, z] S [2’, y’, 2] means that x < 2’, y S 2’ and 
zs 2’. Again 


(4.3) X-(¥+Z)=X-Y+X-Z and (X + Y)-Z=X-Z24+ Y-Z. 


(In words, multiplication is distributive with respect to addition.) For 
X-(Y + Z) and X-Y + X-Z alike consist of all couples [x, y] and [x, z] with 
reX, ye Y, and ze Z, where [z, y] S [z’, y’] means that z S 2’ andy S y’, 
(x, 2] < [z’, z’] means that x S 2’ andz S 2’, while [z, y] < [2’, z] and [z, y] = 
[x’, z] are always denied. This proves right-distributivity; left-distributivity 
follows by commutativity. 

® For instance (3.1) is proved as Theorem 25.1 of the author’s On the combination of 
subalgebras, Proc. Camb. Phil. Soc., vol. 29 (1933), p. 460; (3.6) follows from this and 
E. Schréder’s Algebra der Logik; (3.2) is immediately obvious. 
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Again, although exponentiation is not commutative—in general, XY = Y*— 
it satisfies the usual identities 


(4.4) X’X? = X"*¥) and = X*Y* = (XY), 
(4.5) X'=X and = (X’)* = X"?. 


The first identity of (4.4) is easy to prove. The system X”X7’ consists of all 
couples [f, g] where f and g have Y resp. Z for domains, and so are equivalent 
(Y and Z being non-overlapping in Y + Z) to a single function h with domain 
Y + Z. Moreover [f, g] < [f’, g’] means that for all ye Y and ze Z, f(y) S 
S’(y) and g(z) S g’(z)—that is, that for allwe(Y + Z), h(u) S h’(u). Hence 
Ta =e. 

Again, X”Y% is the set of all function-couples [f, g], where f and g are from 
Z to X resp. Z to Y, and [f, g] = [f’, g’] means that f(z) S f’(z) and g(z) S g’(z) 
for all z. But each such [f, g] can be regarded as a function A carrying each 
z «Z into [f(z), g(z)] « XY—and so, since h S h’ if and only if [f, g] s [f’, 9’, 
X’Y’ = (XY)’. 

That X' = X is obvious; it is a corollary (using (4.4)) that X° = XX, 
X* = XXX, --- , and that X"X" = X"*". 

Actually, the second half of (4.5) is not simple to prove, and we shall start 
by analyzing X"’. By definition, this consists of all monotonic functions f 
with arguments [y, z] and values z [x eX, ye Y, z¢€Z]. Such functions f asso- 
ciate with each z«Z, a function g, with arguments y « Y and values g.(y) = 
Sify, z]) in X. But for any z, y S y’ implies [y, z] S [y’, z] in YZ, hence 


g:(y) = fily, zl) S f(ly’, 2]) = g-(y’) 





and so g; is in X”. 
Furthermore, if z < 2’ and y is fixed, then 


gy) = f(ly, zl) S f(y, 2’) = gy), 


and so the correspondence f: z — g: is monotonic, which makes X”’ a subset 
S of (X")*. While f < f’ if and only if f({y, z]) < f’((y, 2]) for all [y, z]—which 
means that for all z, g-(y) < g:(y) for all y, which means that for all z, g. < 9: 
and so X*’ is isomorphic with S. But conversely, given f «(X”)’, if y S y’ 
and z < 2’, then g.(y) S gs(y) S g(y’), and so f « X””, which completes the 
proof of the fact that X"% and (X")’ are isomorphic. 


5. Decomposition theorems. It is evident that if X = X,; +---+ X, = 
Y, +.---+ Y, is any partially ordered system, written in two ways as the 
sum of additive components, then by set-theory X = Zu + --- + Z,., where 
Z;; = X; | Y; denotes the set of elements common to X; and Y;, ordered as 
in X. For if x¢Z,;; and x S 2’, then 2’ e X; and x’ « Y;, whence 2’ € Z;;. 

It follows that if X is finite, it has one and only one representation X = 
NX, + --- + X, as the sum of (non-void) partially ordered sets not themselves 
further reducible. Actually, if we regard the elements of X as vertices of a 





i 
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graph, and join two vertices x and zx’ when x < 2’ or x > 2’ and no element z 
with x < z< 2’ orz > z > 2’ can be interpolated between z and 2’, then the 
irreducible additive components of X are the connected components of its graph. 

The author has proved elsewhere’ that any finite lattice L has a similar unique 
representation L = Ll, --- L, as the product of constituents not themselves 
further reducible. The prime-factor theorem of arithmetic asserts that the 
same conclusion is valid if Z is a totally unordered aggregate. 

Can we unite both these results in the single assertion that each finite partially 
ordered system X can be written in one and only one way as a product X, --- X, 
of “prime” (= indecomposable) factors? It is quite possible. 


6. Solution of equations. One naturally asks if it is true that in the finite 
ease, the rules that 
(6.1) X+Y=X+2Z implies Y = Z, 
(6.2) X-Y=X-Z implies Y = Z, 


which are true for pure cardinals and ordinals, are valid for general partially 
ordered sets. 

It is easy to prove (6.1) by appealing to the result that every finite partially 
ordered system X has a unique decomposition into a sum of irreducible com- 
ponents. For, making this decomposition, we have 


X+Y = (Xi+--- +X) + (Mi t+--- + ¥,), 
X+Z = (Xi+--- + Xq) + (41 +--+ + Z,). 


One sees that by ordinary subtraction, each kind of component must occur 
in Y and in Z alike, the number of times it occurs in XY + Y = X + Z minus 
the number of times it occurs in X, whence Y = Z. 

This argument would prove (6.2) similarly if we knew that the proposition 
stated at the end of §5 was true; in any event, it holds for lattices, since they 
have unique decompositions into irreducible multiplicative factors. 

If it held in general, would it imply that one could, by introducing negatives 
and quotients as ideal elements, extend our arithmetic to form an abstract 
field of elements of the symbolic form (X — Y)/Z? This seems highly 
improbable. 


7. Lexicographic combinations. Let X be any partially ordered system 
whose members are themselves (not necessarily distinct) partially ordered 
systems Y,. 

We shall define >> Y, (in words, the lexicographic sum of the Y, relative to 

x 


the ordering X) as the system whose members are the y, « Y, [x « X], in which 
yz S yz. preserves its meaning in Y,, and yz S y. [7 % 2’] means that x < 2’. 


* On the lattice theory of ideals, Bull. Am. Math. Soc., vol. 40 (1934), p. 616, Theorem 2 
and its corollaries. 
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Similarly, we shall define [] Y, (in words, the lexicographic product of the 
x 


Y, relative to the ordering X) as the system whose members are the functions f 
carrying each x « X into an f(x) « Y, , and in which f S g means that for every 2 , 
either f(z) S< g(xo) or there exists an Z < 2» such that f(%) < g(%) and f(x) S 
g(x) for alla < &%. 
In case X is totally unordered >> Y, and [J Y, are the cardinal sums and 
x x 


products of Hausdorff extended as in §3. In case the Y, are sequences arranged 
in the sequential order X, > Y,. and II Y, are the ordinal sums and products 
x x 


of Hausdorff (loc. cit.). 

If all the Y, are isomorphic—that is, letting them all be a particular partially 
ordered system Y—we get a new binary operation of exponential type. For 
although the lexicographic sum )> Y of the occurrences of Y relative to the 


x 
ordering X is simply the lexicographic product I] X-Y of X and Y relative 
x<Y 
to the ordering X < Y, the lexicographic product II Y, which we may denote 
x 


lex Y*, is a new system which could not easily be defined otherwise. Its ele- 
ments are the different “words” f with letters from X, one in each position of Y, 
and ordered in lexicographic order. 

Lexicographic combination is non-commutative, and seems to have no interest 
aside from the fact that lexicographic combinations of simply ordered (or well- 
ordered) systems are always themselves simply ordered (resp. well-ordered). 
This property makes them useful in ordering words in dictionaries. 


HARVARD UNIVERSITY. 














THE IMBEDDING OF RIEMANN SPACES IN THE LARGE 
By Cart B. ALLENDOERFER 


1. The ‘“‘classical”’ theory which describes the properties of an n-dimensional 
Riemann space immersed in an (n + p)-dimensional Euclidean space has its 
roots in the work of Voss and Ricci,’ and is a natural generalization of the 
properties of a two-dimensional surface in ordinary three-space. 

A more recent theory is that of W. Mayer, the latest refinement of which 
appeared in the Transactions of the American Mathematical Society in 1935 
(vol. 38, pp. 267-309). We shall refer to this paper as “M”. This theory com- 
bines the generalization of the properties of an ordinary surface with the general- 
ization of those of a curve in an n-dimensional Euclidean space. Thus the theory 
speaks of the “first normal space” of an n-dimensional Riemann space as an 
analogue of the first (‘principal’) normal of a curve. Second and higher 
normal spaces also occur as extensions of the notions of the second and higher 
order normals of a curve. Associated with each normal space is a fundamental 
form, the form for the r-th normal space being of the 2(r + 1)-th degree. These 
forms give a complete description of the geometry of the normal spaces. Mayer 
has shown that the coefficients of these forms are not entirely independent, 
but satisfy certain relations. The first object of this paper is to prove some 
additional relations of this nature and to determine a set of them which is the 
necessary and sufficient condition on a number of forms that they describe a 
Riemann space which is actually imbedded in a Euclidean space. These results 
are stated in Theorem VII. In the course of the argument (§5) we prove a new 
set of identities in the curvature tensors of order higher than the first which are 
analogous to those for the classical Riemann tensor. 

The previous treatment of Mayer’s theory had the further defect that it was 
purely local, referring to an unspecified domain about a suitable point. In this 
paper the differential equations which occur are investigated with the purpose 
of finding the maximum region within which they may be integrated. We 
define singular points and show that the results of Theorem VII hold for any 
simply connected portion of the space which does not contain a singular point. 


Received December 1, 1936. 

1A. Voss, Zur Theorie der Transformation quadratischer Differentialausdriicke und der 
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G. Ricci, Formole fondamentali nella teoria generale di varieta e della loro curvatura, Rend. 
dei Lincei, vol. 11 (1902), pp. 355-362. 

A modern exposition of the theory is given by Eisenhart, Riemannian Geometry, 1925, 
Chapter IV. 


317 














318 CARL B. ALLENDOERFER 


“1° ° es ] r a 2 
This treatment is in the spirit of recent work by T. Y. Thomas and W. Mayer 
in similar connections, and many of the methods used here are due to those 
writers. 


2. Riemann manifolds. We shall consider a Riemann manifold, R,, i.e., a 
coérdinate manifold endowed with the Riemann metric 
(2.1) ds” = gagdx* dz", 


where the ga3(2) are continuous functions having continuous derivatives to a 
definite order. This order will be specified in the next section. We assume 
that the determinant | g.3| is positive definite. Let y' (¢ = 1,---,n + p) 
be the rectanguiar Cartesian coérdinates of an (n + p)-dimensional Euclidean 
space E,,,. Then R, is said to be imbedded in E,,,, if the equations 


(2.2) a 2. 


have a solution 
(2.3) y' = ¢'(2) 


in each coérdinate neighborhood of R,. We shall understand hereafter that 
the codrdinate system z is a typical coérdinate system, not some special one, 
and consequently an invariant equation such as (2.2) will be understood to hold 
in all coérdinate neighborhoods without further mention of this fact. 


3. The normal spaces. For each value of a and for linear transformations 
a 4 
of the y’s, the quantities oe are the components of a vector field defined for all 
a di , ay’ 
values of x CU. The totality of vector fields defined by o* ie (where o* 
ze 


are arbitrary scalars in E,,,,) is called the first osculating space J;. In a similar 
manner, the set of all vector fields defined by 


; 2a) 2 
(3.1) x (0 oy + gm F fee pe gtien ms gate) 





ox*! Ox! Ox? Ox! +--+ xr 


(where again the o’s are scalars) is called the A-th osculating space J2..., . 
The invariance of these spaces under transformations of the 2’s is proved in 
M, p. 270. 

The totality of vector fields belonging to J2..., and normal to Jj2...,-1 is 
‘alled the (hk — 1)-th normal space, J,. We shall denote the projection of the 
vector field 


a y' 


Oxr% +++ arr 


2T. Y. Thomas, Riemann spaces of class one and their characterization, Acta Mathe- 
matica, vol. 67 (1936), pp. 169-211; Fields of parallel vectors in the large, Compositio Math- 
ematica, vol. 3 (1936), pp. 453-468. 
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into the normal space J, by Yo, ...«,. Since the vector fields belonging to each 
normal space are normal to those belonging to every other normal space, we 
have 


(3.2) a (h # k). 


Since there are at most p independent normal vectors, it is clear that there 
are at most (m — 1) S pnormal spaces. It is therefore of significance to speak 
of the “last” normal space, J,, . The index m will have this significance through- 
out the paper. 

Let us now assume that the functions ¢'(x) occurring in (2.3) are of class C"™', 
that is, they have continuous derivatives of the (m + 1)-th order. The co- 
efficients of the first fundamental form g.3 given by (2.4) are therefore of class 
C™. We shall also denote them by Eas(= gas). The coefficients of the A-th 
fundamental form are defined by 


(3.3) | a) eer = I a 


for h = 1,---,m. From their definitions it is clear that Ea, ... a4) 8, ...3, 
are of class C"”"**. They are also tensors under transformations of the z’s, 
for it follows from their law of transformation that Y4,...., are tensors. Be- 
cause of (3.3) the matrix’ 


E, = || E,, +++ an 1B, +++ Ba || 
is positive semi-definite and its rank is the same as that of the matrix 
VY, = | Yi, -.0s ll 


where z denotes the rows and combinations of a; --- a, denote the columns. 
In fact (ef. M, p. 274) the equations 


(3.4) Ratha" " =@ 
and 
(3.5) Y5, ...5,0%°°° ® = 0 


have the same solutions at any point P. 

A point P C R, will be called a regular point of the imbedding if there exists 
a neighborhood N(P) > P within which each Y, is of constant rank. This 
implies that each E, is of constant rank in N(P), and conversely. A point 


which is not regular will be called singular. Consider now the matrix 
| Eas | ! 

| Sa 
pa) (Eatin \. 


| 





| | 
| _ ae ae 

®’ The rows are given by combinations of the indices a --- a, , and the columns are 
given by combinations df 6; --- Ba. 
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It is clear that E is of constant rank in N(P). Suppose, conversely, that in a 
neighborhood M(P) the matrix EF is of constant rank. Now there exists a 
neighborhood M(P) C M(P) within which each E, has a rank not lower than 
its rank at P. But since the rank of Z is the sum of the ranks of FE, , and since 
the rank of E is constant in M(P), it follows that each E, is of constant rank 
in M(P), ie., P is a regular point. Hence we may make the equivalent 

Derinition. A point P C R, is regular if there exists a neighborhood N(P) 
within which the matrix E is of constant rank. Other points are called singular 
points. 

It will become evident in the next section that the differential equations we 
consider leave the rank of Z constant in any domain in which they can be inte- 
grated. Hence we see that our singular points are also singularities of the 
system of differential equations. We shall show them to be the only such 
points. 

It is evident at once that the set of singular points is nowhere dense in R, 
and is a variety of dimension lower than n. Hence there will exist a neighbor- 
hood of every regular point which contains only regular points, and in fact this 
neighborhood can be taken to be simply connected. We shall call such a domain 
the neighborhood U, and shall limit our discussion to regions of this nature. 


4. The Frenet equations. It can be proved (M, p. 278) that at any point 
P CU there exist T’s which satisfy the Frenet equations 





i (y’) Yu 
C) ‘ 
ax”? (Ya,) Fae Y5, + a 
(4.1) oe on | | 
arent (Ya,.--as) = 2. to Oe *Ba-1 + -. eins Y3,-+-8, + Vas---eati> 
a ri ) 31°**Bm—1 31° ri 
axons (} ay***ay,]? = Pay: +:am+i *Bm—1 + ae ao Vy-+-Bns 


From (3.2), (3.3) and (4.1) it follows that for every point we have 


7 1 i) ri ri 
(4.2) Ez,.. --+Balyis<*%h ile or => le ee y yi'''%h 
and 
++ Bh a i ri 
(4.3) Ep,..-Beesivi---m01 a-e a EB idl Y yie' Yama 


On account of (3.2) we may write (4.3) in the form 


, ++ Ba ri d fyi 
(4.4) EE TRO -_ —Voy--an 55 (Var ra-a): 
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Since by hypothesis the right hand sides of (4.2) and of (4.4) considered as 
functions of the 2’s are of class C” and C”™*** respectively, we have the fact 
that the left sides have this same property. 

Since we have assumed that each matrix E, is of constant rank r, in U, to 
every point P C U there corresponds at least one non-vanishing r,-th order 
minor, D, , of each matrix E,. Because of the continuity of the E’s there exists 
a neighborhood V(P) > P which is contained in U and within which no D, 
vanishes. 

Now we know that the I’s actually exist, and so it is possible to solve (4.2) 
and (4.4) for the I’s as functions of the z’s within each V(P). The solutions 
in general will be determined to within additive functions 6°!***** and @°!°**4-*, 
respectively, which are solutions of 


(4.5) E4,..-batvi---v or = 0, EB p,.--Beeslva---1a=1 aliieiaiie = 0. 


The 6’s may be chosen so that 13'.7%4,, and 2'!>' are of class C"™” and 
c”"* respectively in V(P). 

If r2""78. are solutions of (4.2) in V(P) and if 1’'""°%,, are solutions in 
another neighborhood V(P’), it is clear that in V(P) n Vv (P’) 


8,°°*B 8 
Ferd 8 anon a es | 


A similar result holds for '’%!'""%). Mayer has proved (M, p. 379) that for 
transformations of the 2’s, 9 ie are tensors, and that lito ra,r have a 
transformation law like that of the Christoffel symbols. 


If we form the integrability conditions of (4.1) in any V(P), we obtain 








PL" *Ph—1 POI’ *oh—-2 WPL" *Ph—1 PF *oh—2 i _ 
(4.6) OR ee Weems ~~ Be eee Pee Fae OO 
o1°**Oh—1 VOL Oh—1 
ODay -+-aqr a al ai" "ane + re. “Ph pets: “h— 1 ea "Ph 1 ven" * “eh 1 
— — — at -10 ay Pi’ *Ph— 
». Ox? Ox’ “e “Pho 1 ape 1 A~i 
(4.7) 


\Pi"* "Ph 1°" *Oh—1 pi’ 1° ""eh—-1 | yt . 
+ Oe ear Tote one) — Ty: “ene Tone oar yin = 0; 





OT ay--aar _ OV ay --- ane ae 
( a sizes a ao Pe fear Ugh: oe at hy > eee F 
(4.8) 


+ TG). Sar 8S — TG,---age Oe + TG). -eaee — TG: adn = 0, 
or 
ease + Fk tie ~ Fe agree Oh 
where B’s are thus defined. When h = m we have 
(4.9) ae laren OO 
Finally we have 


OID) Cie — Te eee ht + FS eaneve — TR ered Faye oO. 
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In these equations the index h has the range 1, --- , m, it being understood 
that no terms involving zero or negative indices are written. The entire set 
may be written in a more convenient notation by writing s, for the set of h 
symmetric indices 0; --- o,. A single latin index without e subscript stands 
for a single greek index. Thus, for example, we write T@,..2,, = Tats. Here- 
after we shall pass from one notation to the other w ithout comment. If (4.6) 
to (4.10) be multiplied by appropriate Y’s and the summations performed (for 
example multiply (4.6) by Y%,...s,-,), the set can be written in the shorter 
notation as follows: 





(4.11) (Page Prpriw — Voge’ Pratt) Boy—2ime—2 = 05 
T.;' Was rh=1 8 ra~t qta~t 
( si ME Pe Pte — Pete! Pic 
(4.12) 
+ Pant io me: Tae Tits") Baste = 0; 
(4.13) (Bariw + Fave —_ Tore) Bay ids = 0 (h = 1, itch ,™ foe: 1); 
(4.14) Bor tw E...\bm = 0; 
(4.15) (Pagrie Se — Pear 8s + Tegrite — Toj-iee) By-.r1m, = 0. 


If we differentiate (3.2) and (3.3), substitute from (4.1), and again make use of 
(3.2) and (3.3), we have forh = 1, ---,m 


(4.16) Eay—yt}0s + ra; Ba-sja-1 ed 0; 

7 te] 
(4.17) a (Ea,\v,) — Tere Ean, — ltt, Faia, = 0. 
The entire set (4.11) to (4.17) hold in every V(P). 


Turorem I. If y' = ¢'(x) are functions of class C”*’ which define a Riemann 
space imbedded in a given E,,,, throughout a neighborhood U,, then 


1. The tensors Eo,...a,)8;---8, (k = 1,--+,m) are defined as functions of 
class C"*** in U; the matrix E, is positive definite in U, and the matrices E; (h = 
2,---,m) are positive semi-definite of constant rank r, in U and rz + +++ + Tm 


2. “The quantities re ”"2—1(g) and 1%!""78 (x) are defined to within a solution 
of (4. 5) in each V(P) CU and are of pre c”**" and C”"™, respectively. 

3. The E’s, the Y’s, and their derivatives satisfy (4.11) to (4.17) in each neighbor- 
hood V(P). 


5. Some identities and formal relations. We interrupt the geometrical argu- 
ment at this point to consider some formal relations which we shall need later. 
The expressions 


Batrritw = By tw Esriag (h = 1, ---,m) 
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are called the components of the h-th curvature tensor, the first curvature tensor 
being in fact the ordinary Riemann tensor. From (4.13) and (4.16) it follows 
that 


(5.1) Baxi pp itw = Eazwi pat -_ a,t| pyw (k = 1, 7: a = 1). 


These equations are generalizations of the Gauss equations of the classical 
theory, so we shall continue to call them the Gauss equations. Since the right 
sides of (5.1) are of class C”™, it follows that Ba,)p,j: are of class C™™*. In 
fact, we see that the curvature tensors are defined as functions of this class 
throughout U in spite of the fact that the I'’s from which they are formed are 
defined only within each V(P) C U. It is also clear that the arbitrariness 
which occurred in the definition of the I'’s has disappeared from the B’s. Be- 
cause of (5.1) there exists a set of relations in the B’s which holds for all z C U 
and which is a generalization of that which holds for the Riemann tensor. By 
examining the definition of the B’s, it can be shown that these relations are 
actually identities in the z’s. They are the following: 


(5.2) Bayt ppitw + Bagi py iwt = 0; 

(5.3) Bazi ppitw + Boy ia; \tw = 0; 

(5.4) Bayt r.-yvltw + Baxipp-swivt + Bagipr-tiwv = 0; 

(5.5) PUR, cite tens) = 0, 

where P denotes the sum of terms formed by the cyclic interchange of (a; --- ax 
6, --+ Ber). We indicate by a bar over an index that it is not to be included 
in the permutation. The final identity is 

(5.6) S(Ba,---az| 3; ---Belre) = 0, 


where S indicates the sum of k + 1 terms, of which the first is the term written 
above, and the q-th of which is obtained from the (q — 1)-th by interchanging 
a@,-1 With the last index of the (q — 1)-th term and §,_, with the next to last 
index of the same term, and then interchanging a,_; and 8,_; in the result. In 
case k = 1 this becomes the ordinary identity 


Bajajrw + Bajrjas = 0. 


For k = 2 we have 


Baya\ 8; 8:\te + Basie sgia, 3, + Busyjrayiag8. = 9. 


In order to generalize the Bianchi identity we define a generalized covariant 
derivative which will be denoted by a semi-colon and which is a special case of 
those considered by A. W. Tucker. The rule to be used is shown by the follow- 


* A.W. Tucker, On generalized covariant differentiation, Annals of Math., vol. 32 (1931), 
pp. 451-460. 
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ing example, where t; and r, are compound indices of the same or different orders: 


an 


ty — 
(5.7) Tike = = 


q t t P 
+ Tx Poe — i A iu . 
The ordinary rules of covariant differentiation apply in this case, and for exam- 
ple we may write (4.17) in the form 
Bray onit = 0. 


When A = 1 and f = 1, we have ordinary covariant differentiation with respect 
to I's,; this will be denoted by a comma. The generalized identity may be proved 
by direct calculation from (4.11)-(4.17) and has the form 


(5.8) Bagipaitw;0 + Bag\ vy ivtzw + Bayi vpiwr3t = 0. 


If (5.6) is differentiated covariantly and each term in the result is replaced 
by the corresponding two terms given by (5.8), we have 


(5.9) T(Bay.--ay\ 81 -+-Bx\7030) = 9, 
where I denotes the sum of terms obtained by the cyclic interchange of 
(ay -** M& 6B; eee Bnw). 
By differentiating (5.5) we have 

P(Ba,.--a4 8; +++ Bylrari) = 0, 

the cyclic interchange omitting w and A. Adding to this the identity 
P(Ba,.--ay| 8y---Bslivia) = 0, 

and using (5.8), we have 

(5.10) P(Bay.--ay| 8,-+-Balaizr) = 9. 


This may be thought of as another generalization of Bianchi’s identity. 

It is also of importance to show that (4.12) is a consequence of the other 
equations of the set (4.11)-(4.17). Differentiating (4.16) and eliminating 
derivatives of the E’s by (4.17), we have 

pa 


Th-1 Sh—1 sh—-1 
Bry ian Ox” = —Toiriw (Prite Buy y\on-1 + Tay-ie Ey-s\ra-1) 


8h Sh 
—Ta,-¢9 E.,\h-1 = T's;-,00 Busiea-\t- 


Substituting this into (4.12), we obtain an expression which vanishes identically 
because of (4.15) and (4.16). Thus (4.12) is a consequence of (4.15), (4.16), 
and (4.17). 

A final relation is obtained by substituting in (4.11) from (4.16). The result is 


Th-1 Th-1 
Pont Ey,-\1a,-20 = P paw Ey-y\an-2t = 0. 








»); 
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Because of (4.16) this becomes 
E pxie,-20t Sed E py \ex—2to = 0, 


which vanishes identically by the very definition of the E’s. Hence (4.11) 
is a consequence of (4.16). These last two relations are given by Mayer (M, 
p. 293) but his proof is from a somewhat different point of view. 

Corotuiary. If the matrix E,, is of rank one, the h-th curvature tensor vanishes 
identically. 

For since E, is of rank one at any point P C Ll’, we may write 


Ea,\b, = Aa, Ary - 
Then (5.3) may be written 
(5.11) By tw Ae, Aa, + Bortw Ae, Ap, = O. 
If we put a, = p, , the result is 

- ay ee eg 


But at least one \,, (say A,,) is not zero. Then the corresponding Boh. A., = 0, 
and consequently for any an, Bori.Es,;4, = 0. But from this result and (5.11) 
it follows that 


(5.12) he den Ae, = 0, Bokiw Xo, = 0. 


Consequently all B,,)a,j00 = Oin U. This result has been obtained in a different 
manner by Burstin.* 


6. Determination of the I'’s in terms of the Z’s. We now turn our attention 
to equations (4.11)-(4.17) and deduce from them a set of necessary conditions 
on the coefficients of the fundamental forms. We know that in each V(P) 
there exists a solution of (4.16), i.e., 


(6.1) ae Eat, = — Baytitys; ’ 
where these are considered as linear equations in the I’s. Hence we have the 
Matrix Conpition I. The augmented matrices E, = || Bajja,; Boxes, || 


are of rank r, in U for every choice of t and by. . 

Similar linear equations in T;}, may also be derived. Before proceeding to 
the general case, we illustrate the method by taking h = 1. From (4.17) it 
follows that 


OE a\3 0E3)- OE, « 


ox” Ox* axe * 











(6.2) QV, Ey 6 = 


Since the determinant | E,;,| # 0, these may always be solved for T2,. Thus 
the T'2, turn out to be the Christoffel symbols of the second kind. 


5C. Burstin, Beitrige zur mehrdimensionalen Differentialgeometrie, Monatshefte fiir 
Math. und Physik, vol. 36 (1929), p. 114. 
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For h = 2 we have from (4.17) and (4.15): 
(63) QE opis Vase = Bepiys(Tae ds + T3e6o) + Basiys.e + Easyitea 
+ EBysjeas — Eseias.y — Eeaiay.s = Prsiade, 
where ¢,s\as are thus defined. We know that these equations, considered as 
linear in the [’s, have a solution in every V(P). Thus we have the following 

Matrix Conpition Il. For every choice of (aBe) the augmented matrices 
E; = || Bepiys} Gy8\ase |, are of rank re in every V(P). 

It will be noticed that these conditions are closely related to the Codazzi 
equations of the classical theory. 

In order to extend these results to a general h, we define the operator Q to 
indicate the sum of terms obtained by cyclic interchange of (a; -- + a6: +--+ 8,7), 
a plus sign being attached to a term if the last index in the preceding bracket is 
7 or any a, and a negative sign if it is any 8. Thus the general case of (6.2) 
and (6.3) is 


4 viet yan18 Yis'Ya—1 98 
2E,,. -7y—18131 ++ -Samrd Dag 1¢€T - 2(Paj---an-ir Oe 


58 vt YA—1) OB ay. -ay—r€18;---B4—1 
+ I.,6 Oai-*-apn—1 ) B,,.. *¥a—16/8,---3~— A + o( ant ws 





(6.4) 
, Hi Hh Mit MA—1 
ean Ba, .--aaa10 wren By + Baar = Bay ---na—relty->-Pamrbl ay ---en—10 


, xp , Pp 
= Bay---0a-10 By++-Ba— iA rer — | a Ai-- Baie . i). 
In our briefer notation this may be written 


_ Al sCh-jd Ca- d d Ch-1 
(6.5) 2B ey~.4 ba-ial pi ed Q(T agai 6 pt Ppt 603-1) Bex—idiba—ie + Q(Ea,-svibn-rait)s 
or 
c 
2E cy ids Nae = Pbplagty 


where ¢»,\a,¢ are so defined and we have written c, for cy:d, ete. nee we 
know that the I’’s exist which satisfy these equations, we get 

Matrix Conpition II (General case). The augmented matrices E, = 
|| Bexibn3 Pda lane || are of rank r, in every V(P) for each choice of (apt). 

As we have stated them, the matrix conditions I involve Z,,), and Ea, ,,\0,4, 
only. On the contrary, conditions II involve their first derivatives, and T'i-}., 
and thus are not expressed as conditions on the fundamental forms. However, 
T;.} is given by (6.5) in terms of Z4,_,),-, and Th-3,. By repeating this process 
we finally can eliminate all T’s and may regard the matrix conditions II as involving 
only the fundamental forms. 

In a similar manner it follows that the Gauss equations (4.13) and (4.14) 
can be expressed in terms of the coefficients of the fundamental forms and their 
derivatives only. It is conceivable that in eliminating the T'’s by means of (6.5) 
the arbitrary 6’s might enter into the resulting expressions. Direct examination 
of the formulas involved, however, shows that this is not the case. We have 
thus proved 
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TueoreM II. The matrix conditions I and II and the equations (4.13) and 
(4.14) are conditions on the coefficients of the fundamental forms which are necessar- 
ily satisfied if the forms define an imbedding of a Riemann space. 

We now turn to the converse problem and determine conditions which are 
sufficient for these forms to define such an imbedding. We assume at the 
outset that the E,,», satisfy (1) of Theorem I and that they also satisfy the 
conditions stated in Theorem II. We define a set of I’s in each V(P) by the 
following recursive process. For h = 1, (6.2) define T's, uniquely as functions 
of class C"™’. These actually satisfy all equations in (4.11)-(4.17) for h = 1, 
for the only equations which occur in this case are (4.15) and (4.17), and these 
are indeed satisfied identically because of (6.2). 

For h = 2, we define It’s, by means of (6.3). This is possible since the right 
hand sides contain only the given E,s),,, their derivatives, and a previously 
determined set of T'3,. Since the matrix Eis of rank rz, a solution of (6.3) exists 
and is determined to within an additive function 62%, which is a solution of (4.5). 
These solutions are defined as functions of x in each V(P), and the additive 
functions can be so chosen that they are of class C”*. In a similar fashion 
I's,s are defined as functions of class Cc” in U by equations (6.1). It remains to 
investigate whether the I’’s so defined actually satisfy (4.11)-(4.17) when h = 2. 
If we substitute for T%, in (4.15) and then use (4.13), we obtain 


Bia be.y + Bysse.a + Bay be. = 0. 


But this is Bianchi’s identity, and so our I’’s satisfy (4.15). Similarly the result 
of substituting for I's}; in (4.17) vanishes because of (4.13) and the identity (5.9). 
Since (4.16) are satisfied by the very definition of Ty,;, and since (4.11) and 
(4.12) are consequences of the other equations, it follows that if Z.;, and EB. ),s 
satisfy the Gauss equations and if E; is of rank re, it is possible to find P'f,, and 
V't’se Which satisfy (4.11)-(4.17) for h = 2. 

For a general value of h we solve (6.1) and (6.5) for ['}}7' and I?t, in terms of 
the coefficients of the fundamental forms and previously determined I'’s. And 
as for the case h = 2, it can be shown that these new I’s satisfy (4.11)-(4.17) 
as a consequence of the identities of §5. Hence we have 

THEOREM II]. Let there be given a set of tensors Ea, ...a,\8,--.8, (kh = 1, +--+ , m) 
which are defined as functions of class C"™~**' in U, where E, is positive definite in U, 
and E, (h = 2,---,m) is positive semi-definite of constant rank r, in U and 
ro +--> +r, = p. Then quantities T3)...g,5 and T5)..s,3°' can be found 
which will satisfy the equations (4.11)-(4.17) in every V(P) C U if and only 
if the Gauss equations are satisfied in U, the m-th curvature tensor vanishes in 
U', and the matrices E,, and E;, are each of rank rxin U. The V$)..3%% so defined 


Bar 
are of class C"™ and the T$}..42*~' are of class C”~"** respectively. 


7. Solution of the Frenet equations in a restricted neighborhood. We now 
suppose that a set of E’s are given which satisfy the conditions of the preceding 
theorem. In a definite V(P) we define the I'’s by the method just described. 
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Consider further a spherical neighborhood W(P) covered by a single coérdinate 
system and such that V(P) > W(P) > P, i.e., one such that its map in the 
arithmetic space is the interior of a sphere. We now seek to integrate the mixed 
system composed of the Frenet equations (4.1) and the algebraic equations (3.2) 
and (3.3) in the neighborhood W(P). 

For convenience let us denote (4.1) by 





; ti=1,---,n+p; 
0 i ri . aY; ) ri ‘ 
(7.1) apa (') = Vast 5 oS = ear Vi h=1,-++,m; ) 
K = Qy-1,0,, As, 


Let the codrdinates of P be xo, and assign the values (y')o ; (Y,)o to the unknowns 
at P such that (3.2) and (3.3) are satisfied when x = 2. It is clear that this 
choice of initial values is always possible, for the matrix E is by hypothesis a 
symmetric, positive semi-definite matrix of rank n + p. Therefore a canonical 
representation of the form (3.2) and (3.3) is possible at P.° 

Moreover, any two possible sets of initial values (¥i,)o are. connected by the 
relation 


(7.2) (Yi,)o = ai(¥2)o ; 


where the a’s are an orthogonal matrix of constants. The values of (y')) are 
completely arbitrary. 

Picking out one such set of initial values we now seek to find the corresponding 
values of the unknowns at another point Q of codérdinates x, C W(P). To do 
this we join P and Q with an arbitrary curve C, lying in W(P), defined by 
x“ = f*(t), where f*(0) = xf; f*(1) = af and f*(é) are single valued and of class 
C’for0 < ¢t <1. There always exists such a curve, for in particular we may 
take f*(t) = 2¢ + (af — 2¢)t. Now integrate 


dy" _ Y‘ dz* | dc 
= Y% — 


along C for the above set of initial values. The quantities y‘(t), Yi,(é) thus 
defined satisfy (3.2) and (3.3) for ¢ = 0 by hypothesis, and indeed they satisfy 
(3.2) and (3.3) all along C. For let 


(7.4) [ay “ssa, Bieee Bi] — a Sen ~_ Tin +> <tahiy-ooiys 


(7.3) 





' ; dx’ 
= eaurle()] Yi 5 


(7.5) [oy +++ an Br-++ Be) = Yi,...05 Yo, 4p (h ¥ k). 
Then (3.2) and (3.3) may be written 
(7.6) [ar --- an Pr--- Bi] = 0 (h,l = 1, ---,m). 


Differentiating (7.4) and (7.5) along C and then applying (7.4) and (7.5) to 
the result, we find that the derivatives of the bracket expressions are linear 
combinations of the bracket expressions themselves, the coefficients involving 


6 Cf. Duschek-Mayer, Lehrbuch der Differentialgeometrie, II, p. 25. 
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the I’s and dx*/dt, plus terms which vanish because of (4.16) and (4.17). But 
this shows at once that (7.6) is satisfied all along C. 

TurorEeM IV. The solutions Y},(t) of (7.3) whose initial values satisfy (3.2) 
and (3.3) for x = 2 satisfy (3.2) and (3.3) for all points of C. 

We have assumed that the conditions (4.11)-(4.17) are satisfied in W(P). 
Moreover, we know from §3 that if (4.11)-(4.15) hold, then the integrability 
conditions (4.6)-(4.10) are satisfied provided that (3.2) and (3.3) are true. 
Hence we have 

TuHeorEM V. The integrability conditions (4.6)-(4.10) of the Frenet equations 
are satisfied along the curve C by the solutions of (7.3), whose initial values satisfy 
(3.2) and (3.3) for x = x. 

In order to prove that the values of the y' and Yi, thus defined at Q are genuine 
functions of the z’s, it is necessary to show that these values are independent 
of the curve C along which we have integrated. Join P and Q with some other 
curve C, of class C’ lying in W(P) whose equation is 2* = f7(t), where ff have 
the same properties as f*.’. Since W(P) is a spherical neighborhood, C and C, 
can be imbedded in a one-parameter family of curves x* = G*(t, p), joining 
P and Q and lying in W(P). The G*(t, p) are assumed to possess the following 
continuous derivatives 


aG. aG aG  aG 


at’ ap’ atap dapat 
for0 < t S landO S p S 1 and to be such that G*(¢, 0) = f*(é) and G*(t, 1) = 
ST(); G*(, p) = x ; and G*(1, p) = zf. In particular, we may put G*(t, p) = 
£O + aALKO — fl. 


Consider the equations 
dy’ aYi, 


ri ae 
(7.7) a = Yelp, 01 =; ie 


- aG’ 
ae 





= Pony [G(p, i) Y. 


along any curve of the family, i.e., for a particular value of p. If we take for the 
values of the unknown at P (corresponding to t = 0) the quantities which 
served as initial values for our integration of (7.3), the equations (7.7) define 
a set of functions y‘(t, p), Yi,(¢, p) for all values of t and p in the specified ranges. 
Because of Theorem IV they also satisfy 


(7.8) Eo, (Gt, p)] = Yal@t, p)l¥iGt, p)I.- 


For ¢t = 1 we have the values of y‘(1, p) and Yi,(1, p) at Q as defined by the 
curve of the family of parameter p. We now show that these values are actually 
independent of p. 


7 The following treatment is based on that given by T. Y. Thomas, Systems of total 
differential equations defined over simply connected domains, Annals of Math., vol. 35 (1934), 
pp. 730-734. Thomas’ treatment, however, applies only to completely integrable systems 
which do not involve additional algebraic equations. 
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Consider the equations 








= ay" 2 oY; k = 
: = Y‘i(G —* = ¢,,(G 
(7.9) ry ) gt ) = at Ga, (G) Yi.(G@) 
( un 
ov YG) o® +e, 
(7.10) a ‘ oe 
OY a, t, p ‘ 
t ap = Cary (G)Y i(@)? -—“s + Cap, ’ 


where o} and o}, are defined by the last equations. If we differentiate (7.9) 
with respect to p, and (7.10) with respect to ¢, and equate the right hand sides 
of the resulting equations, we obtain 


do, OG oy 1 8G AG’. 


wel a Mba Sion es 
rte a ies ie 
(7.11) @ ag 
0 a i 7 - vi 7] rs] 
O0a _ gigk . + G53 k 


at “at ap’ 


where @),3 and 5, ,; are the bracketed expressions on the left sides of (4.6)-(4.10). 
From Theorem V we see that 

$),;Yi = 0, $%,,5Yi =0 
for all allowable ¢t and p. 

Now for t = 0, we have y’, Yi,, and G*“(0, p) by hypothesis independent of p; 
hence from (7.10) it follows that o) = 0 and oj, = 0 fort = 0. And so from 
(7.11) we have that o} = Oand a1, = 0 for all allowable values of ¢t and p, and 
in particular for = 1. We have also by hypothesis that G(1, p) are independent 
of p, and hence from (7.10) it follows that the solutions of (7.7) at ¢ = 1 are 
independent of p. 

We therefore define y'(x), Yi,(x) as.functions of x in W(P) whose values at any 
point Q © W(P) are obtained from a given set of initial values satisfying (3.2) and 
(3.3) at P by integrating along any curve of class C’ joining P and Q and lying in 
W(P). These functions will satisfy (3.2) and (3.3) in W(P). 

The proof that these functions are unique and that they are actually solutions 
of the differential equations is given by Thomas.’ It is also shown there that if 
Q C W(P) and if initial values are chosen satisfying (3.2) and (3.3) at Q, there 
exists a unique solution of the mixed system in W(P) having the given initial 
values at Q. To find this solution, it is only necessary to integrate along a curve 
from Q to P, thus determining values of the functions at P. With these as 
initial values the system can be integrated in W(P), and it is shown that the 
resulting solution has the required initial values at Q. We shall need this 
property in the next section. 

TuHeorEM VI. Let (1) there be given a set of functions Eay.--ay|8;---6, (hk = 
1, --+ , m) of class C” "in a spherical neighborhood W(P) such that E, is positive 
definite in W(P); E,, is positive semi-definite of constant rank r, (h = 2, --- , m) 
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in W(P) and rz + --- +r,» = p; (2) the matrices E;, and E, be of rank r, in W(P); 
(3) the Ba, ..-a,\3,--.3, and their derivatives satisfy the Gauss equations in W(P); (4) 
the m-th curvature tensor vanish in W(P). 

Then and only then do the Frenet equations (4.1) and their associated algebraic 
equations (3.2) and (3.3) have a unique solution y'(x), Y4,...a,(2) for all x C 
W(P) for each set of initial values (y')o and (Y%,,...«,)o which satisfy (3.2) and 
(3.3) at any point Q C W(P). The Riemann space defined by E.;3 in W(P) is 
thus imbedded in a Euclidean E,,, . . 


8. Solution of the Frenet equations in an open simply connected domain. 
Next we consider an open simply connected domain U of a coérdinate manifold. 
Suppose that functions F,,),, (2 = 1, --- ,m) satisfying the hypothesis of the 
last theorem are defined throughout U. Then each point P C U is contained 
in at least one spherical neighborhood W(P) C U within which the I’’s may be 
defined as in §6. If two such neighborhoods W(P) and W(P’) have points in 
common, from §4 we see that the ([)p defined in W(P) and the (T)» defined in 
W(P’) satisfy the equations 


Eayiosl(Ton,.)p a (ret, e] = 0, 
Eay-si—l(Tersie — (Petside-] = 0 


in W(P) N W(P’). We shall now show that the conclusions of the theorem of 
§7 can be extended to the entire neighborhood U’. 

Pick out any point P C U of coérdinates z» and assign initial values (y')o , 
(Yi,)o to the unknowns at P which satisfy (3.2) and (3.3) at P. In order to 
determine the corresponding value at another point Q C U, join P and Q with a 
simple curve C of class C’. This is possible, since U is connected. Because 
of the Heine-Borel Theorem, C can be covered by a finite set of the spherical 
neighborhoods W,,---,W;, where W, = W(P) and W; = W(Q). Let 
Vi, +--+, Vy be points on C such that V, CW, N Wou Gg = 1,---,.f — DD 
and V; = Q. Integrating along C from P to V,, we obtain values (y'); and 
(Y3,), at V; corresponding to the given initial values. Then taking these values 
at V, as initial values integrate in W2 from V,; to V2 and continue this process 
until finally by integration from V;_, to Q values (y')g and (Yi,)¢ are defined 
at Q. 

Suppose next that P and Q are joined by another simple curve of class C’, 
say C,, which passes through the same set of neighborhoods W,, --- , W; in their 
natural order. Select points X,, --- , X; on C, such that X, CW, N Wy. 
(g = 1,---,f — 1) and X; = Q. Now by the above process define values 
(y')g and (Yi,)o at Q corresponding to the curve C, for the given initial values 
at P. 

Let V, and X, (g = 1, --- ,f — 1) be joined by some curve of class C; , say 
S,, lying in W, N W,.:. Integrating (7.3) along S, from V, to X, for the 
values of the I’s, ((),, defined in W,, and for initial values satisfying (3.2) 


(8.1) 
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and (3.3), we obtain a set of y'(t) and Yi,(t) for each point of S,. Now (7.3) 
are actually equations of the form 


dy yi dx* 

° ee i 
(8.2) ‘ 
dx” 
dt © 


Now substitute the y‘(t) and Yi,(0) obtained by the above integration along S, 
into the equations (8.2) in which the (1), are replaced by the (T),.: defined in 
W,4: , and subtract the resulting — from (8.2). We obtain 


dx“ ; dr” 
bat “dt =a 


Because of (8.1) and the fact that (3.2) and (3.3) are satisfied along S,, we see 
that (8.3) are satisfied identically in t. Hence integration along any S, gives 
the same result, whether S, is regarded as a curve of W, or as a curve of Wo: . 

Next we observe that the values of the unknowns at X, are the same whether 
the integration has proceeded from P to X, along C; or from P to V; along C 
and then along S, from V; to X; ; for all these curves lie in W(P). We indicate 
this symbolically by 


(8.4) PX, = PV,X). 


io ’ 
a (Yen) = wre: 7... 1 + a2 aj / > 4 ba + ea 


(63) 1s), — Tas ¥ + [(Ter)o — (Teevosl ¥3 = 0. 


Now suppose, in order to establish an induction, that the values at X,_; are the 
same whether we have proceeded along C, to X,_; or along C to V,_; and then 
along S,-; from V,_, to X,_,. That is, we assume that 


(8.5) PX,-1 = PV 5-1 X5-1 ° 


Then the values at X, as found by proceeding along C,; from P to X, are the same 
as those obtained from considering the curve C from P to V, and then the curve S, 
from V,to X,. That is, 


(8.6) PX, = PV,X,. 


For to values at X,_; given by integration along C,; from P to X,_,, the cor- 
responding values at X, are the same for the integration along X,_,X, as for the 
integration along X,_,V,1V,X, , since all curves involved lie in W, ; i.e., 


(8.7) Xo-1X¢ = Xy-1 Vou VX, . 


Combining (8.7) and (8.5), we establish (8.6). Since the hypothesis of the 
induction holds for g = 1 because of (8.4), the relation (8.6) holds for any g, 
and in particular for g = f, i.e., at Q. Therefore the values (y')g and (Yi,)e 
determined by the curve C covered by W,, --- , Wy are the same as those determined 
by any other simple curve of class C' lying in W,, --- , Wy and passing through 
them in their natural order. 








ee aS ~~ 
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Finally, consider any simple curve C; of class C’, lying in U, joining P and Q, 
and distinct from C. As above C,; may be covered with a finite number of 
neighborhoods and gives rise to a set of values (y')g and (Y%,)e corresponding to 
the given set of values at P. Suppose that C and C: are imbedded in a one- 
parameter family of curves joining P to Q and such that for the parameter value 
p = 0 we have the curve C and for p = 1 we have C,. Assume also that the co- 
érdinates x“ on curves of the family are continuous functions of p for 0 S p < 1. 
Hence if some member of the family, say C;, of parameter p is covered by a 
finite set of neighborhoods, Y,,--- , Y;, there exists a 6 > O such that for 
|p — p| < 6 all curves of parameter p lie in Y;, --- , Y; and pass through them 
in their natural order. Because of this property and the last italicized state- 
ment, the [y‘(p)]e and [Yi,(p)]e are differentiable with respect to p for 0 < p < 1, 
and in fact the derivatives are zero for each value of p. Hence the unknowns 
are uniquely determined at Q for given initial values satisfying (3.2) and (3.3) 
at P by integrating along any simple curve of class C’ joining P to Q and lying 
in U. 

THEOREM VII. Theorem VI remains true if the spherical neighborhood is re- 
placed by an open simply connected domain. 

From (7.2) we have the fact that any two sets of initial values of Y, which 
satisfy (3.2) and (3.3) must have the relation 








(8.8) (Ya)o = a} (Y4)o, 
where || a} || is orthogonal. Hence the corresponding y’s satisfy 
(8.9) y = ayy" +0'"7 


where b* are constants determined by the given initial values of the y‘ and y"’. 
Therefore, the above imbedding is determined to within a motion in the Euclidean 
space. 


PRINCETON UNIVERSITY. 








REMARK ON THE THEOREM OF GREEN 


By S. BocHNER 


The theorem of Green is an identity between an integral over a compact 
region R of an orientable locally Euclidean separable space S of class two' whose 
dimension will be denoted by n, and an integral over the boundary B of R, in 
case B is formed by “hypersurfaces”: 


(1) [ av Adv = [xed 
R B 


If R is closed, that is, compact and equal to S, thus having “no boundary”, 
the integral over the boundary is to be put equal to zero: 


(2) I div \dv = 0. 
s 


If R is contained in one coérdinate neighborhood, the proof of (1) is compara- 
tively simple, provided the boundary B is sufficiently smooth with respect to 
the coérdinate system.” But the passage from the local case to a domain R 
in the large is rather laborious. It requires a cellular subdivision of R into 
sufficiently small subregions whose boundary is sufficiently smooth, an applics- 
tion of the local theorem to each subregion, and finally a justification of the 
mutual cancellation of the boundary terms arising from the artificial cellular 
partitions. Now this procedure is much too complicated and heavy in the 
case of formula (2) or in case of formula 


(3) [ div \dv = 0 (x* = 0 on B). 
. R 


We want to show that these two formulas can be deduced in a much simpler 
fashion, even avoiding any complication that might be inherent to the local 
theorem itself. In particular we shall eliminate from the proof the concept of 
[(n — 1)-dimensional] volume on the boundary B. 

Our space S being of class two, we can consider on it tensors of class one 
and tensor densities of class one. In particular, we assume the existence of a 
positive (non-vanishing) scalar density which, as in the special case of Riemann 
spaces, will be denoted by +/g; in going over from codrdinates (x, --- , 2x) 
to codrdinates (y;, --- , yn), the quantity ~/g is to be multiplied by the jacobian 


Received December 19, 1936. 

1 That is, a space allowing local coérdinate transformations with continuous first and 
second partial derivatives and a positive (non-vanishing) Jacobian. 

2? A. Duschek and W. Mayer, Differentialgeometrie, 1930, vol. II, p. 237 
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3 - ‘ > . 
| éx/dy|.° With such a scalar density we can form an invariant volume ele- 
ment 


(4) dv = dv, = Vg dx +--+ dtq. 


It gives rise to a regular Lebesgue measure and a Lebesgue integral as in the 
Euclidean case V9 = 1; the only material difference being that in the case of 
every closed space S the total measure of S is finite. 

If \‘ is a contravariant vector, the quantity 


i .. 1 a(vgr') 
(5) div \ = r a 


is an absolute scalar. This follows easily from the fact that the (n — 1)-dimen- 
sional minors A;; (with appropriate signs) of the matrix 





| Or; || 

(6) Bf amex Uf 
|| By; || 

satisfy the relation 

(7) he Le Gok >. oe 
dy; 


Hence we can form the integral of div \ over any bounded measurable set A. 
If A is contained in a coérdinate neighborhood, then 


(8) [ aiv Adv = I (V9 ') dz, +--+ dx,. 
A A Ox; 


Suppose now that A is a rectangle a, < z, < b,, vy = 1,---, nm, and that X*° 
vanishes on the boundary of A. In this case, for each 7, 


[mae = 0, 


i Ox; 


and therefore 
A 


Also, the integral of div \ is 0 over any open set on which \‘ vanishes. Hence 
we obtain the first result that (2) is true if \* vanishes outside some “rectangle” 
A. If S is closed, and hence bicompact, we can cover it by a finite number of 
neighborhoods U,,---,U,, whose closures are contained in ‘rectangles’ 
A,, +--+: ,Am respectively. Corresponding to each yu, uw = 1, --- ,m, we can 
easily find a neighborhood V, between U, and A, and a non-negative scalar 
function ¢, of class one in A, which is 2 1 in U, and 0 outside V,. Completing 
¢, by values 0 outside A, we have, throughout S, gi: + --- +¢mn 21. Hence the 
functions of class one 


QO. Veblen, Invariants of Quadratic Differential Forms, 1927, pp. 19-25. 
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(10) v a 
ae oe 

have the following properties: each vanishes outside some rectangle and their 
sum is 1. For fixed » we multiply the vector \’ by the scalar y,._ The resulting 
vector X{,) vanishes outside a rectangle, and hence satisfies (2). But div \ = 
> div yw , and this completes the proof of (2). 
w= . 

Formula (3) can be proved in a similar way provided X* is strictly regular up 
to the boundary, this expression meaning that, if completed by values 0 outside 
R, it is a vector of class one throughout S, so that (3) may be written 


(11) I div Adv = 0 (x' = 0 outside R + B). 


In fact, we can construct non-negative functions of class one g,-°-: , Gm, 
each vanishing outside some rectangle and whose sum is 2 1 for all points of 
R + B. The functions (10) will be of class one in some neighborhood U of 
R + B, although perhaps not throughout S. But \* vanishes outside R + B, 
and therefore the vectors dj, are again of class one, and we can again prove (3) 
from 


(12) [ div ado = >> | div ry de. 
w=1 JS 

We can make further statements if S is a Riemann space with a positive- 
definite fundamental tensor g;; having the following property. Corresponding 
to any point x° of S there exists a neighborhood U(zx°) such that for any points x, y 
of U(x") the function Q(x, y), which is the square of the geodesic distance between 
x and y, has continuous first partial derivatives in (%1, +++, 2n3 Yrs *** y Ynd 
and that, for some constant C = C(2»), 


(13) || grad, 2 ||? = g(x ms 


< CX(z, y). 

The neighborhood U(X°) entering our condition can be assumed to be a 
sphere with center at 2°: Q(z, x) < p,p > 0. If x° transverses a set A whose 
closure is compact, then by the Heine-Borel theorem, this radius p and the 
constant C(z°) may be chosen uniformly with respect to A: p = p(A), C = 
C(A). If 0 < r < mm = $p(A), then A, will denote the open set consisting of 
all points z for which Q(z, z°) < r for some 2° C A. Let g(t) be any non- 
negative function of the real variable ¢ which has a continuous first derivative 
in0 St S 3, is positive forO < t < }andOforl St <3. If 3r < 7, the 
expression 


vo ror= [SPY m/f, oP) 
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defines a non-negative function of class one in A3, which has the value 1 on A 
and the value 0 outside A2,. Completing P,(x) by the values 0 outside As, , 
we obtain a non-negative function of class one throughout S which is < 1 
everywhere, has the value 1 on A and the value 0 outside A2,. Owing to the 
nature of our function g(t), the denominator is bounded from below by a con- 
stant multiple of r" uniformly for all points z in A, , and both the numerator 
and denominator are bounded from above by a constant multiple of r”, for the 
given function g(t), and, likewise, if this function is replaced by ¢’(t). Forming 
now the partial derivatives of P,(z) and using (13), we find that 

(15) || grad, P,(x) || < &, uniformly for z C A,. 

We are now in a position to drop completely in formula (3) the requirement 
that \‘ be strictly regular up to the boundary and to interpret the boundary 
condition “A‘ = 0 on B” as a very light condition requiring that \"(z) have limit 
values 0 as x approaches the boundary B from within the region A. 

Tueorem. If R is an open region whose closure R + B is compact and \* 
is a contravariant vector of class one in R, then 


(16) [ aiv Adv = 0 
holds, provided 

(17) i |divA| dv < « 
and 

(18) lim 2 Vaa¥N do = 0. 


Proof. Condition (17) simply states that our integral shall also be absolutely 
convergent and condition (18) requires that the “length” of \‘(x) shall “in 
average” tend to zero as x tends to the boundary, in the sense that the integral 
of || \‘(x) || over the set of points having a distance less than r from the boundary 
shall, as r tends to 0, be small in relation to the quantity r. 

Now we shall give the proof itself. We consider the region R, the boundary 
B, and the exterior R. In B + R we complete * by values 0, and we multiply 
\‘(x) by the function 


(19) Q(x) = 1 — P,(z), 


the function P,(x) being formed with respect to the compact set B. It is easily 
seen that \(,, = Q,-’ is strictly regular up to the boundary and vanishes on B. 
Hence (16) holds for \(,,. All we have to prove now is the limit relation 


(20) [ div (A — Aq )dv = [ div (P,A)dv — 0 asr— 0. 
R R* Bor 
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But this follows from 
div (P,-\) = P,-div \ + grad P,-d 


in connection with (15), (17), (18) and | P,(x) | < 1. 

In the theorem we have just proved, the boundary may in part or totally 
consist of points which are limit points of R and not of R; in other words, the 
theorem also takes care of the case in which the vector has singularities along 
lower dimensional manifolds. In particular it can be concluded that in Green’s 
theorems (1), (2), (3) the vector \‘ may cease to be of class one along sufficiently 
smooth manifolds of dimension < n — 2 as long as its length stays bounded in 
the neighborhood of these manifolds. 

We can also make a statement regarding equation (1) itself. Let S be closed, 
and suppose that on some given boundary B it is possible to define an area 
element dw and a vector #; such that (1) holds for every vector \* which is de- 
fined and of class one in a neighborhood of R + B. Then the same is true for 
the exterior R of R + B, which means that 


(21) [ diva ae = [x0 de 


for every function \* which is defined and of class one in a neighborhood of 
R + B, provided that the (n-dimensional) volume of the set B is 0. In fact, 
(21) follows immediately from (1) and (2) whenever \* is defined and of class 
one throughout S. But the values of the two integrals in (21) depend only 
on the values of \‘ on R + B. Hence (21) will hold for a vector \' which, with- 
out altering its values on R + B, can be modified into a vector which is of class 
one throughout S. Now if \° is of class one in a neighborhood of R + B, this 
modification can be obtained by multiplying \‘ with a function P,(z) that be- 
longs to the compact set R + B and by putting it 0 in other points. 


PRINCETON UNIVERSITY. 











ANALYTIC MAPPING OF COMPACT RIEMANN SPACES INTO 
EUCLIDEAN SPACE 


By S. BocHNER 


Recently H. Whitney’ has proved that an n-dimensional separable codrdinate 
space S of class C, (1 S ¢S ~) can be mapped topologically onto the Euclidean 
Een+; in such a manner that the mapping functions 


(1) t, = t, (m1, --- , Zn) (vy = 1,---,2n + 1) 


belong to class C, on S and have a Jacobian of rank n throughout S; the quanti- 
ties 7,, +--+ , 2, in (1) are local coérdinates on S varying with the neighborhood. 

It is not known whether for an analytic space S the mapping functions (1) 
can be chosen analytic. It is the purpose of the present paper to point out that 
they can be so chosen provided S is compact and has an analytic Riemann 
metric. 

The line of reasoning is very simple. With the fundamental tensor g;;(z), 
we form the Laplacian 


1 @ i 09 
2 he = = —i ae’ —) = 6” ¢.4. 
(2) ¢ LA (os ae ) 9° ii 
In the Hilbert space H of all square integrable functions on S, the Laplacian 
is essentially the inverse of a completely continuous operator. Therefore the 
solutions ¢ of the equation 


(3) Ag = e 


form a complete basis in H. But the solutions of (3) are analytic if the co- 
efficients g;; are so. Hence every function ¢(z) on S is the limit in square mean 
of analytic functions. For differentiable functions on S we shall prove more; 
if t(x) belongs to C,,, then corresponding to any « > 0 there exists an analytic 
function ¥‘(x) such that the function t(r) — Y‘(x) and its gradient differ by less 
than ¢ throughout S.? We now apply this approximation to the functions (1). 
It is easy to see that, for ¢ sufficiently small, the approximating transformation 


(4) t, = Wilt, +++, In 


will have the same mapping properties as the original transformation (1), which 
proves our assertion. 


Received February 16, 1937. 

1 Differentiable Manifolds, Annals of Math., vol. 37 (1936), pp. 645-680; p. 654, Theorem 1. 

2 In order to prove this conclusion it would be sufficient to assume that ¢(x) belongs to 
C;,. But the proof would become more elaborate. 
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The Laplacian Ay on a compact space S was treated by Hilbert (for n = 2)’ 
and by G. Giraud.‘ In order to avoid too many references to these and other 
papers, which would have to be supplemented in minor points in any case, we 
shall reproduce a number of known details in the first half of this paper. 

Finally, in the last part we shall have occasion to generalize the properties 
of the Laplacian from scalar functions to tensors of arbitrary rank. 


Part I. Definitions 


1. We shall consider a space S which is a compact orientable coérdinate space 
of an arbitrary dimension n with an analytic positive-definite fundamental 
tensor g;;(x). In order to avoid a known trivial complication in writing, we 
shall exclude n = 2, and thus assume n 2 3. 

The family of analytic functions and tensors on S, or on a specified open set 
in S will be denoted by C,, and C,,0 < q S ~, will denote the family of func- 
tions or tensors of class q. 


2. A point on S will be denoted by zg, y, z, %, &, n, --- , its coérdinates by a 
corresponding subscript. The invariant volume element on S gives rise to a 
theory of Lebesgue measure and Lebesgue integration with the customary 
properties, except for the fact that the total measure of S is finite. The measure 
element will be denoted by dv, or more specifically, by dv, , dv, , dv;, etc. Thus 


[se or [@ dv, 


will denote the integral of the function f over the set A. In case A = S, we 


shall simply write 
[sa or [ 1) aw, 


3. The class of functions f(x) which are bounded and measurable on S, or 
on a specified subset of S, will be denoted by B. Furthermore we shall con- 
sider, on the total space S, the Hilbert space H of all (real) measurable functions 
of integrable square, with the inner product 


0) = | fade 


We shall also use the symbol || f ||? for (f, f). It follows easily, as in the case 
of a Euclidean region S, that every subspace C, of H is dense in H. It is our 
main objective to prove that C, is likewise dense in H. 


3D. Hilbert, Grundziige einer allgemeinen Theorie der linearen Integralgleichungen, 1912, 
chapter 18. 

4 Probleémes miztes et problémes sur des variétés closes, etc., Annales de la Société Polonaise 
de Mathématique, vol. 12 (1933), pp. 35-54; p. 43. 
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4. Functions of two variables, f(z, y), f(z, £), etc., will be defined either on 
the product space S° = S X S, or on a specified part of it. It is clear what 
will be meant by the symbols c.f. 


5. Corresponding to any point 2° there exists a neighborhood U = U(z’) 
such that any two points z, y C U have a unique geodesic distance R = R(z, y). 
The square of the geodesic distance will be denoted by 2 = Q(z, y). If U is 
sufficiently small, then Q(z, y) C CZ. Owing to the bicompactness of S and S” 
we can find a number pp > 0 such that Q(z, y) C C3 if R(x, y) < po. 

The local analyticity of Q(z, y) becomes obvious if Q(z, y) is expressed in 
normal coérdinates. The normal coérdinates z; originating at the point 2; 
are analytic functions 


2; = 2i(x, y) = yi — 2; + (higher terms in z and y) 
defined for z, y C U(a’). They are characterized by the relations 
(5) Gis(z)z; = gii(x)z;, 


in which §;;(z) is the fundamental tensor expressed in the z-coérdinates originat- 
ing at the variable point z, with the initial values 9;;(x) = g;;(x); the tensor 
gi;(z) being given in a fixed coérdinate system of which z and y are arbitrary 
points. Important properties of the functions z;(z, y) are 


02; Oz; ae 
(6) az; — bij, ral ifz = 0, 
(7) a9ii(2) = 0, if z= 0, 
Oz 
(8) Q(x, y) = gislz)ecz; = gilz)ec2z;.° 


6. The family C%,,, 0 < r < n, consists of functions Q(z, y) with the following 
properties. Q(z, y) is not defined for z = y. Everywhere else on S$’ it belongs 
to class C?. Inside some belt R(x, y) < p, Q(z, y) and each of its derivatives 
is representable as a quotient 





A(z, y) . 
(9) RG, y 


also, within this belt, Q(z, y) has the order of magnitude O[R(z, y)], and each 
derivative of order go(< q) has the order of magnitude O[R(zx, y)" “*). 

We shall also consider a class of functions B?, 0 S r <n. The function 
Q(z, y) belongs to B? if it is measurable on S’, bounded outside some belt R(x, y) 


* For the existence of normal coérdinates with these properties it is not necessary to 
assume that our space is analytic. They exist if S is a space of class p 2 2, and they are 
themselves of class p — 2. Checking all steps of our argument, we could easily see that 
our Theorem I holds if p = 6, but normal codrdinates can be avoided altogether and the 
class of the space reduced by several units. Compare G. Giraud, loc. cit. 
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< p, and O[R(z, y)’] within this belt. For such functions we shall consider 
the process of convolution. For instance 


(10) Q’+Q” = [ee £)Q"'(E, y)dog. 


7. The symbol #,(¢) will denote any function in 0 S$ t < © with the following 
properties. The function has derivatives of all orders, is non-negative through- 
out, has the value 1 in 0 S ¢ S 4p, values between 0 and 1 in 3p < t S p, and 
the value 0in p St < @. 


8. The symbol P(x, y) will denote any fixed function with the following 
properties: 
(11) P(x, y) = Ply, 2), 
(12) P(x, y) © Cane, 


and there exists a p > 0 such that 


(13) P(z,y) = if R(x, y) <p 


= Y 
R(a, y)"*’ 
for some prescribed constant y. We shall prescribe 


_ TG@n- 1) 


(14) — 








The existence of such a function is trivial. For example, the function 


#,([R(z, y)] 


(15) P(s, 9) = ¥ 
mn TRG, 
will do. 
Any function P(x, y) with these properties has the further important property 
(16) A. P(x, y) C Ci.n-2- 


This property states essentially that the Laplacian, if applied to our function 
P(x, y), does not raise the exponent of its singularity on the diagonal x = y; 
whereas, as a rule, if applied to a function from C’%,,,, it increases the exponent 
r by 2. The proof of (16) follows easily if R(x, y) is expressed in normal co- 
ordinates issuing from x; in such codrdinates z; we have® 


i _ mM a log g(z) =e 
A.R = (1 *) _— Zi R ’ 


and, by (7), the factor of R~” on the right is O(R’). 





*W. Feller, Lésungen der linearen partiellen Differentialgleichungen zweiter Ordnung 
vom elliptischen Typus, Mathematische Annalen, vol. 102 (1930), pp. 633-649; p. 639, foot- 
note. 
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It will be important for us to consider not the Laplacian Ag itself, but the 
operator 


(17) A‘g = Ag — eg 
for some positive constant c. The constant ¢ being fixed, we shall write 
(18) K(x, y) = A: P(z, y), K(z, y) = K(y, z) = Ay Pty, 2). 


These functions again have the property (16). 


9. The distance R(x, y) is comparable in size to the Euclidean distance 
(19) E(x, y) = (ys — 2}. 


Hence, the integral 


(20) v(y) = / ¢(x)Q(x, y) dv: 


exists for every y if g(x) C B, Q(z, y) C Co, r <n. For fixed Q(z, y) and 
variable g(x) the integral operator (20) will be written 


(21) v = Qe. 


We shall be mainly interested in the operators Pg and Kg resulting from the 
functions P(x, y), K(x, y) that were introduced before. 


Part II. Preliminaries 


1. We shall state several properties of the expression (20), all of which are 
trivially true if Q(x, y) happens to vanish inside some belt R(x, y) < po. 
Lemma 1. Jf g(x) C B, Q(z, y) CCo,-, r <n, then the function (20) belongs 
to Co*; more precisely, the modulus of uniform continuity of ¥(y) depends only on 
M = Lub. | e(x) |. 
zCs 


Proof. With the function #,(¢) of Part I, §7, we form the expression ¥,(y) = 
S e(x)Q(z, y)[1 — & {R(zx, y)}]dv.. For o fixed, our lemma holds for y,(y) in the 
place of ¥(y). But ¥.(y) tends to ¥(y) as o — 0, uniformly in y and M. 

Lemma 2. If g(x) C B, Q(z, y) C Ci,., r < n — », then the function (20) 
belongs to C,, again uniformly in M, and the partial derivatives of ¥(y) may be 
obtained by partial differentiation under the integral. For example, 


ay / aQ 
22 — = zx) — dv. 
(22) ay1 o( ) on 
Proof. By using induction, it is sufficient to consider the case y = 1. Since 
aQ C Corus, 7 +1 < vn, the integral 
OY 


(23) ib / ol) 28 
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represents a function in Cy). We restrict ourselves to a coérdinate neighborhood 
(yi, °**, Yn) and integrate the function g(y) with respect to the variable y 
between the fixed limits a and 8. Since the integrand on the right side is a 
Lebesgue integrable function on S° we can exchange, on the right side, the 
two processes of integration for almost all (yz, --- , y,), and hence 


B 
I gly, °** , Ynddyr = WB, yo, --* , Yn) — Wla, yo, +>, Yn) 


for almost all (ye, --- , Yn). But both sides of the last relation are continuous 
and, therefore, it holds throughout. Making now 8 variable, we find that 
¥(y) © Ci, and that g(y) = ay/am. 
Lemma 3. If g(x) C Cy and Q(z, y) C Ci,n-2, the function (20) belongs to C2. 
Proof. It will be sufficient to show that the function (23) belongs to C;. 
Relation (6) implies 


02; 02; 
oman os a 


+. O[R(a, y)). 


Hence, remembering that .Q(z, y) = A(z, y) R(x, y)* for R(x, y) < po, and 
using expression (8) for Q(z, y), we easily find that 


oq aq 
(24) S * hk 


belongs to Ci,n,-2. Hence, on account of Lemma 2, the non-trivial part of 
function (23) is the function 


(25) hy) = { (2) CEP a, 


and it is sufficient to show that the latter function belongs to C,. We restrict 
the point y to a sufficiently small neighborhood U(y°) whose closure is contained 
within a sphere R(x, y°) < p. Forming with our previous function ®,(t) the 
decomposition 


g(x) = {y(x)%,[R(z, y°)]} + {e(z) — o(z)®,[R(, y)]}, 


we see immediately that the second component of g(x) contributes to the func- 
tion (25) a component of class C,;. Hence we may assume that g(r) and ‘its 
derivatives vanish outside some coérdinate neighborhood and that y is a point 
of this neighborhood. Denoting the product of g(x) and the volume density 
g’ by $(x), we can therefore write 


ha) = ff a2) MEM doy ++ dee, 


where A is an n-dimensional Euclidean domain and ¢(z) is a function of C; 
vanishing on the boundary of A. Let y be a fixed point within A, A, the 











1e 
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difference of A and a small sphere of radius p around y, and B, the inner boundary 
of A. By Green’s formula, 


[ (x) 28 ar, +++ dt, = -| 2? Ode, ++ da, +f cos (€, v) O(E) QUE, y) dw; . 
i 21 A, OX By 


Since Q(é, y) = O(p “**), the surface integral tends to 0 as p — 0, and therefore 
h(y) = -{ = Qa, y) da, +--+ dx,. 
A 1 


By Lemma 2, h(y) belongs to C; . 

Lemma 4. If Q(z, y) C B? and L(x, y) C Bi, then Q * L belongs to B?,,_. 
ifr+s—n> 0;to Bi, foranye>0,ifr+s—n = 0;andto Boifr+s—n 
< 0. 

Proof. The lemma is known in case S is a bounded set in Euclidean space 
and R(z, y) has the value (19). For our present case it follows from the fact 
that in every coérdinate neighborhood the quantity R(x, y) is majorized by 
(19) from above and below. 

Lemma 5. If Q(x, y) C Ci,., 7 < n, then, from some positive v onward, the 
v-th iterated kernel 


(26) Q, = Q*Q*---*Q (v factors) 


belongs to C3. 
Proof. By Lemma 4 there exists a Q,_;(z, y) which is bounded. But 


Que, 0) = | Qe, 900, var = [ Qa, 906, war. 


Therefore, by Lemma 1, Q,(z, y) is continuous in x uniformly in y, and con- 
tinuous in y uniformly in x. Hence Q,(z, y) C C3. 


2. We shall now discuss the operator (21) for ¢ C H. To start with, the 
expression (20) is not defined for arbitrary elements ¢ in H. But it is defined 
for ¢ in B, and the subset B of H is dense in H. This suggests the possibility 
of extending the operator formally into a wider set of H. In fact, the operator 
(21) can be extended uniquely over the whole of H. 

Lemma 6. For any Q(z, y) C Ci,,, r <n, the expression (20) defines a unique 
operator (21) which is bounded and completely continuous. Also, for any two 
such kernels Q’(x, y), Q’’(x, y), the relation 


(27) Q'1Q"(e)] = (Q’ * Q”")@) 
holds. 

Proof. We first assume that Q(z, y) is continuous throughout. In this case 
it is known that Qg is bounded and completely continuous for gin B. B being 
dense in H, the operator Qy has a unique extension into the whole space H, 
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and this extension is again completely continuous.’ The value of Qg for an 
arbitrary element ¢ C H can be computed in the following manner. Form any 
sequence of elements {¢g,,} from B of which ¢ is the limit in the strong topology 
of H: lim || ¢ — gn |, = 0. The values ¥» = Qym can be computed from (20) 


and they are in their turn convergent to an element y of H. This element is 
the desired value (21). Finally, relation (27) is obvious for ¢g C B by Fubini’s 
theorem, and for general ¢g it follows by the limiting process we have just de- 
scribed. 

In the case of a general kernel Q(x, y) we shall apply an important eriterion.* 
An operator Qy is bounded and completely continuous in H, if corresponding 
to any e > 0 there exists a decomposition 


(28) Qe = We t+ Re 


of the following nature. R’g is bounded and completely continuous, and Q’¢ 
is a bounded operator whose bound is S «. We put 


Q’ = Q(z, y)?.[R(z, y)], R° =Q — Qe. 


R* is continuous, and hence the operator R’g is bounded and completely con- 
tinuous. As for the operator ¥’ = Q’y, we have, for g C B, 


ay vit= fv artan= ff Weare, vavede, 

where 

(30) M*(é, 0) = / QE, YQ (y, n) dy. 

The kernel M’(é, ») has the following two properties: 

(31) M*(é, n) = 0, if RE, 1) > 20; 

there exists an exponent s, 0 S s < n, and a constant A such that 
A : 

: | M’ |s ——— S 20. 

(32) | M°(é,n)| S RG” if R(é,) S 2 


Property (31) is an immediate consequence of the fact that Q°(z, y) vanishes for 
R(x, y) = o; whereas property (32) is an easy consequence of Lemma4. Putting 


| e(é) |? + | en) |? 
2 





| e(é)e(n)| S 
in (29), we obtain 


(33) 2i|y" ||’ s | o(8)| fire, n)| dey fa + [er'| f iar. n)| dg | de. 


7S. Banach, Opérations linéaires, 1932, p. 99. 
§ Banach, loc. cit., p. 96, Théoréme 2. 
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But, owing to properties (31) and (32), there exists a positive quantity «(c), 
tending to 0 as o tends to 0, such that 


/ MED dey sce), — [ (MEW dy se). 


Hence each of the integrals in (33) is S e(¢)-||¢||’, and therefore y’ is a bounded 
operator on B whose bound is S ¢(¢). The extension of ¥ from B onto the 
total space H does not increase the bound,’ and, therefore, the assumptions of 
the above mentioned criterion are fulfilled. This proves the first half of the 
lemma, whereas relation (27) can be proved as in the case of a continuous kernel 
Q(z, y). 

Part III. The Laplacian 


1. If in Green’s formula 


1 @ hy: 
34 =— dv, = 
(34) [i Zora 0 
{\‘ is a contravariant vector belonging to C], we put 
i ¢ i oy dg 
ee A eek: ae 
"=, ™ oz,” Ox, 


(35) -| gA‘gdv = ||’ “ =< + ce | dv, 
(36) [ ex'vae = [ we dv. 

From (35) we conclude 

(37) (y, 4'v) = —ely, ¢), 

and relation (36) reads 

(38) (y, AY) = (d’e, ¥). 


Hence, on the dense subspace C2 of H, the operator — A‘¢ is positive definite 
with a lower bound 2 ¢ and symmetric. By a general theorem on operators 
in Hilbert space there exists a closed extension of the operator A‘g for which the 
properties (37) and (38) remain in force.”® It will be this closure of the Laplacian 
that will forthwith be denoted by A‘g; in fact, in order to simplify the writing, 
we shall omit the upper index c, and thus write Ag instead of A°¢ until further notice. 


The domain of Ag is a well defined set D of H, C2. CD CH. 


® Banach, loc. cit., p. 58. , 
10M. H. Stone, Linear Transformations in Hilbert Space, 1932, p. 49, Theorem 2.12. 
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Lemma 7. The operator Ag satisfies the relations 
(39) Ke — ¢ = Padg, ife CD. 
(40) Ke — ¢ = APy, if Pe CD. 


Proof. By the definitions given in Part I, §§8 and 9, relation (39) is equiva- 
lent to 


(41) | [o(2)A.P(a, y) — Ase(2)P(2, y)]dos = oly). 


It is sufficient to prove it forg CC,. For other ¢ it follows from the continuity 
of the operators Kg and Py. The left side of (41) is the limit, as ¢ — 0, of 


(42) [i dona, 


where S, is the total space S minus the geodesic sphere of radius ¢ around the 
fixed point y, and 


aP(z, y) 


A(x) = o(z) — 


dp 
Zz P(a, y). 
By Green’s formula, (42) has the value 


(43) — | x@2EY aun, 


where B, is the boundary of S, and da, is the invariant surface element on B, ." 
In order to estimate (43) we introduce normal coérdinates z; = z,(é, y) originat- 
ing at the fixed point y. We also make a linear affine transformation by which 
9:;(y) = 6;;. This gives for the quantity R(é, y) the value (z;2,)'. Hence 
we obtain, by a trivial calculation, 


aR(é, y) ee 
: (2 — n)y-¢(t) —— Ra az, R= 


= (2 — nro) 4 + a 





r'(é) 








Therefore, (43) has the value 


(44) (n — 2)ywrrg(y) + o( =). 


where w,—; is the (2 — 1)-dimensional Euclidean volume of the unit-sphere 
2:2; = 1. Substituting from (14), we see that (44) tends to g{y) as « — 0, 
and this completes the proof of (39). 


1 W. Feller, loc. cit., §2;S. Bochner, Remark on the theorem of Green, this Journal, pp. 
334-338. 
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Relation (40) will be proved if we show that for every element u of C2, 
(45) (u, Ke — ¢ — APe) = 0. 


Obviously 


(u, Ke) = i u(x)e(y)K(z, y)dv.dvy = (y, Ku) 
and ) 
(u, APe) = (Au, Pe) = [ Seu(2)-e)PQ, 2)doede 
= (y, PAu). 
Therefore the left side of (45) has the value 


(y, Ku — u — PAu), 
and this vanishes by (39). 


2. Lemma 8. If Q(x, y) C Cin-2, f CC2,¢ CH, the relation 


(46) ¢=Qe+f 


implies that p C C2. 
Proof. Iterating (46), we obtain, using relation (27), 


(47) ¢=QA¢+Q5nf + Q0f +--- +f. 
Obviously, the functions Qf, Qf = Q(Qf), --- , af = Q(Q,-2f) are all bounded. 
From a sufficiently high »y onward, Q,(z, y) C B® by Lemma 5, and hence by 


| P 
(48) | / Q.(2, ye(z)de.| < / Q(z, y)*dve / o(2)* doz, 


Q,¢ is bounded. Thus, by (47), ¢ is bounded. Consequently by Lemma 2, 
Qe CC, and hence by (46),¢ CC,. But then, by Lemma 3, Qe C C2, and 
therefore, again by (46), ¢ C C2. 


19 





3. Lemma 9. The equation 


(49) (Ap =)Ag =f 
has a solution for any f C H. 
Proof. A solution, if any, is unique, by (37). It is sufficient to show that 


(49) has a solution for f C C; ; in fact, if we show this, we prove that the operator 
f = Ag has an inverse 


¢ = Jf 


on a dense set in H. By (37) this inverse is a bounded operator and has there- 
fore an extension into the whole of H. If we re-invert the fully extended opera- 
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tor Jf we obviously obtain Ag; in other words, the counter-domain of Ag is the 
whole space H. 

Substituting (49) in (39), we are led to investigate the integral equation 
(50) Ke — ¢ = Pf. 
The operator Ke being completely continuous by Lemma 6, we can apply 
the generalized Fredholm theory.” According to this theory, the equations 
(51) Ke — ¢ = 0, 
(52) Ky —y=0 
have the same finite number of independent solutions. If we denote such basic 
solutions by 


P1y °** > Om, is. *** sins 


for a given element f the equation (50) has a solution, if, and only if, 
(53) (v., Pf) = 9 (u = 1,---,m). 


We remark once for all a fact which we shall tacitly use several times, namely, 
that any solution ¢ or y of (50), (51), (52) is automatically a function of C2, 
and therefore a function of D. This follows from Lemma 8 [in conjunction 
with Lemma 3 for equation (50)]. Let yg, be any solution of (51). Combining 
(51) and (39), we obtain 


(54) PdAg, = 0. 
Hence for ¥, = Ag,, we obtain from (40), Ky, — y = 0. Also, if gi, --- , om 
are linearly independent, then Ag; , --- , Ag» are likewise linearly independent. 


Hence, the basic solutions of (52) can be assumed to have the form y, = Ag,. 
Now, (&, Pf) = (f, Pv), and vanishes by (54). Thus equation (50) has a 
solution g* for an arbitrary element f C C,. Comparing now relations (50) 
and (39), we obtain for the function ¥* = f — Ag* the equation Py* = 0. By 
(40), ¥* is a solution of (52) and may be written in the form c:yi + +++ + mbm. 
In other words, the function g = ¢* + c1¢g1 + +++ + Cm@m is a solution of (49). 

TureoreM I. The operator f = Ag — cg is for every fixed c > 0 the inverse 
of a completely continuous operator g = Jf in H. The solutions ¢ C H of the 
equation 


(55) Ay = he 


(A being any constant for which a solution exists) form a complete set of functions 
in H. They are all, automatically, contained in C2. 
Proof. From (50) we obtain 


¢ = K(f) — Pf = Jf. 


2 Banach, loc. cit., pp. 159-161. 
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By Lemma 6, Pf is completely continuous, and so is A(J/f), since Jf is at any 
rate bounded. Therefore, Jf is completely continuous too. Hence, Jf has a 
pure point spectrum and so has its inverse A‘yg. Therefore, the solutions of the 
equation Ag — cg = hg, or, what amounts to the same, those of equation (55) 
form a complete basis in H. Substituting (55) in (39), we obtain (K — AP)(¢) 
= gy, and therefore ¢ C C2 , by Lemma 8. 


Part IV. The mapping theorem 


1. The following theorem makes full use of our assumption that the under- 
lving space S is analytic. 

THEOREM II. The solutions ¢ of (55) are all analytic. Thus there exists in 
the function space H a complete basis {¢»,| whose elements ¢,, are analytic through- 
out S. 

Proof. The analytic character of the functions ¢,, is a purely local property 
and follows from the following theorem which for n 2 3 was first proved by 
J. Hadamard.” 

Lemma 10. [If the coefficients g;;(x) (the tensor g;; being symmetric and positive- 
definite), bi(x), e(x), f(x) are all analytic in the neighborhood of a point x° = 


(xi, «++ , 2), then every solution g(x) of the equation 
ay dg 

56 a + te — y= 

(56) 9 dr.ax; + mn? f 


which belongs to C2 is likewise analytic. 

Proof. Since Hadamard does not carry out the last steps of his argument 
we shall give a brief summary of it. By a fundamental theorem of 8. Kowalew- 
sky equation (56) has always analytic solutions (locally). Hence it is sufficient 
to prove our theorem for f(z) = 0. We denote for the moment the operator 
(56) by Fg and its formal adjoint by F*¢, and we consider Hadamard’s elemen- 
tary solution G(z, y) belonging to F*¢. G(z, y) is a function of the form 


U(x, y) . ; 
(57) ——? “"_ + log R(z, y)-V(z, y); U(x, z) = 1, 
Y Re, y= + log y y 
the coefficients U(x, y), V(x, y) being analytic in all 2n variables 2, --- ,2,, 


Yi, °°", Yn; and F2G(z, y) = 0. For a sufficiently small spherical surface B 
our solution g(x) of Fg = 0 satisfies a relation 


. aG 

(58) el) = f (oe) FEY + ocercts,y)) de 

for every point y interior to B; the coefficients p(é) and o(£) are continuous, 
and we shall not need their precise value. This is a consequence of Green’s 
formula, in which the volume integrals vanish because g(x) and G(z, y) are 
solutions of F = 0 and F* = 0, respectively. The term ¢(y) emanates from the 


13 Lectures on Cauchy’s Problem, 1923, p. 102. 
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singularity (57) in just the same way as in the case of relation (39). In fact, 
the symmetry of the function P(z, y) has not been used at all for the derivation 
of (39), and a logarithmic term of the form log R(x, y)-V(zx, y) would not alter 
the estimate (44) for n = 3. 

Now let A be any closed point set interior to B. For y C A, é C B, RE, y) 
has a positive bound from below; hence we can find a region A in the space 
of the complex variables y,, --+ , y, over which R(é, y) is still analytic and has 
a real part which has a positive bound from below. Since U(é, y) and V(é, y) 
can also be continued into a complex y-neighborhood of the origin, it follows 
immediately that the function (58) is analytic in some neighborhood of the 
origin. 


2. We again choose a fixed constant c > 0, and denote by g = Jf the inverse 
tof = Ag and by J,f the v-th iterate of Jf. 

Lemma 1l. There exists an integer v = 1 and a constant A such that for every 
f CH the relations 


(59) | JS(z) | = A-|I FI 
(60) | grad J, f(x) | = A-||f || 
hold. 


Proof. We write fo = f, fur = Jf, 4 20. By (50) we have the relations 
fi=Kfi-— Ph, f2=Kfhe— Phi, fs = Kfs — Phe, 
ete., from which we deduce the relations 
fi = Kf, — Pho, fe = KKf. — (KP + PK)fi + PPh, 
fs = KKKf; — (KKP + KPK + PKK)f2 + (KPP + PKP + PPK)fi — PPPhy, 


IV 


ete. In general, 
(61) j A = Qwf, + Qn v—l1 + sta + Qwho; 


the functions Q,,(z, y) being “polynomials” of »-th degree in the functions 
P(x, y), K(x, y). Since P and K both belong to B%_2, there exists by Lemma 4 
a vy = 1 such that the functions Q,, belong to B’. We choose a » with this 
property and hold it fast. Since Jf is a bounded operator, there exists an a > 0 
such that for every f C H 


| fu(z) || S oI Ff || (u = 0,1, ---,»). 
Therefore, by relations (61) and (48), there exists an A > 0 such that 
\f(z)| S$ A-||f\l- 


This proves (59). Raising the exponent v by 1, we can also assume (for some 
other A > 0) 


\f-a(z)| S A-\If ll. 
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Relation (60) follows now from 


j. _ Kf, ee Phi 


by Lemma 2. 


THeorREM III. Jf t(x) is any function on S belonging to C,,, corresponding 
to any « > 0 there exists an analytic function (x) on S such that 


(62) | t(z) — ¥(z)| S «, | grad [t(z) — ¥°(z)] | S «. 


Proof. Applying the operator A‘ v times to the function ¢(x), we obtain a 
function g(x) for which t(z) = J,¢(x). By Theorem II there exists an analytic 
function ¢g‘(x) such that ||~ — ¢°|| S «/A. The function ¥‘(z) = J,¢‘(z) 
is again analytic by Lemma 10, and relations (62) hold by Lemma 11, if applied 
to the function f(z) = ¢(x). — ¢‘(2). 

Theorem III completes the proof of the 

MappinG THEOREM. Any compact analytic Riemann space S, can be mapped 
topologically-analytically onto the Euclidean Eon, . 


Part V. Tensors 


If we use the mapping theorem it can be easily shown that not only the Hilbert 
space of scalars on S has an analytic basis but that the same is true for the 
Hilbert space H of tensors of any given rank. As an illustration we shall 
consider tensors of rank 3, ¢ = ¢gasy. The inner product is, of course, defined by 


(63) (¢, ) bes | eanuao = [ oP Vande 


Given a tensor gasy, We form with the functions (1) the scalars 


at, dt, at 
A i = apy Re Ne ul 
(op, 0,7; x) = g™ (x) ak 





(p, ¢, 7 = 1, 2,---,2n + 1). Conversely, in every coérdinate neighborhood 
on S the tensor components ¢**” can be expressed as linear combinations of the 
functions A(p, o, 7; x), the coefficients being rational functions of the partial 
derivatives a. Since the functions A(p, 7, 7; xz) are limits in square mean of 
analytic functions, the tensor gag, is likewise a limit, in square mean, of analytic 
tensors. 

Ancther more illuminating way of treating tensors is to set up a Laplacian 
for them and to investigate it as in the case of scalars. The definition of the 
Laplacian is 


Ag = 9” Paby.p.9 « 
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Putting 


as 
vag = "79" Pasy.0.0 
v grad ¢ = YG” Gasy.p 5 
Viy, ¢) = fe ane: 


we have the formal identity 


Ag = div (y grad ¢) — VI, ¢) 
which leads again to (36), (37), (38). In order to have an analogue to (39) 
and (40), the quantity P(x, y) must be a tensor of rank 3 in both the variables 


x and y separately. Everything goes through strictly analogously if we replace 
definition (15) by 


7y®,[R(z,y)]) #2 aa aa 
R(x, y)"* Ara dyn 0X8 dy, OX, IY, 





P(2, y) = Papydut, y) = 


The analogy is immediate up to Lemma 10. This lemma is to be replaced by 
the following generalization. 
Lemma 12. If in a system of equations 


i CraN 
64 oF + bai — = 
(64) 9 az, az, + bai or + Carer = fu 
(i,j =1,--*,n;\,u = 1, +--+ ,m; nand mare arbitrary integers) all given func- 
tions are analytic, then every solution ¢,, +--+ , @m which belongs to C2 is also an- 
alytic. 
Proof. We introduce auxiliary variables t,,---,t,. Putting (2, t) = 


t,¢,(z) and multiplying (64) by ¢,, and summing with respect to u, we obtain 








i;., &o a ab 
65 a tubuas —— +! — =t,f,. 
( ) 9 (x) O02; 02; + — 02; Oty + wear ot, Si 
® a ® 
Since oh at, = 0, we can add to the left side of (65) the term 6, Bh dl’ If we 


consider @(z, t) as a function in the n + m variables 7, +--+ ,%nj3t1, ++, tm, 
relation (65) has the form (56), and Lemma 10 applies for sufficiently small 
values of t, since for small values of ¢t the matrix of the coefficients of the second 
derivatives is again positive definite. Therefore (z, t) is analytic, and so are 
the functions 


oP 
¢u( x) = at, : 
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POLAR CORRESPONDENCE WITH RESPECT TO A CONVEX REGION 
By Fritz JoHn 


We denote as polar correspoadence (abbreviated P.C.) with respect to a 
convex region FR in projective n-dimensional space 7, any one-to-one corre- 
spondence of the points of R and the hyperplanes outside R. The study o 
such a correspondence is essentially the study of a contragredient vector with 
special consideration of the convexity of the domain of definition. In §1 of 
this paper the representation of a P.C. in homogeneous coérdinates is discussed. 
A P.C. is called positive if a point and its polar plane are not separated by any 
other point and its polar plane. In §2 it is proved that every positive P.C. is 
continuous. §§3-4 deal with symmetric P.C.’s; a P.C. is called symmetric if 
in the neighbourhood of every point P it is approximated by an ordinary polar 
correspondence with respect to a quadric, denoted as the tangential quadric in P. 
A positive symmetric P.C. may be generated by a convex hypersurface in 
(n + 1)-dimensional space 7,4; in such a way that the line joining any point 
Q of the surface to a fixed point of 7,4; and the tangential plane in Q intersect 
7, in a point and its polar respectively. 

In the remainder of the paper a general class of P.C.’s is discussed, which are 
generated by continuous positive mass distributions on R. Given any hyper- 
plane p outside R, the pole of p shall be that point which becomes the center 
of mass of R with the given mass distribution, in case p is chosen as plane at 
infinity. In §4 it is proved that a P.C. generated in this way is always positive 
and symmetric. In §$§5-6 it is shown that the tangential quadric at a point 
P is identical with Legendre’s ellipsoid of inertia of R if the polar of P is plane 
at infinity. Moreover, some inequalities involving FR and its tangential quadrics 
are given.’ 


1. Let R be an open convex region in projective n-dimensional space 7, ; 
i.e., an open set with the following properties: 

(1) If P and Q are points of R, one of the two straight line segments bounded 
by P and Q belongs to R; 

(2) If S denotes the set of points which are neither points of R nor boundary 
points of R, there is at least one hyperplane in S. 

DeriniTion. A one-to-one correspondence between the points of R and 


Received October 23, 1936. 

! I am indebted to the referee for pointing out that the methods used in §1 are closely 
related to those used by Steinitz in his paper Bedingt konvergente Reihen und konvere Sys- 
teme in Crelle’s Journal; cf. in particular vol. 146, p. 32 et seq., where Steinitz deals with 
convex regions in projective space. Our sets p and fj appear there as number sets A and 
—A, and theorems corresponding to our Theorems 1.7 and 1.8 are given. 
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the hyperplanes of S is called a polar correspondence (abbreviated P.C.) with 
respect to R. If a point P and a hyperplane p are corresponding elements, 
then P will be called the pole of p and p the polar of P. 

Let z, be referred to homogeneous coérdinates 2, +--+ ,2%n4:. Let p denote 
the set of ordered sets x = (x, %2, +++ , Yn41) corresponding to points of R, 
and let o denote the set of ordered sets u = (uw , We, -** , Unsi) Corresponding to 
hyperplanes uetq = 0 contained in S.’ 

For every x C p our P.C. uniquely determines, but for a common factor, 
a u Co; we assume in what follows that this factor is chosen in such a way. that 
for a point z and its polar u the relation 


(A) Uele = 1 

holds. This is possible, as obviously uet%_ # 0. The wa then become uniquely 
determined functions ue = Ua(x) of x = (41, «++ , Xn41) for x C p in the given 
P.C. 


The following properties of the wu. follow immediately from our assumptions: 
1.1. alti, +++ , Zn41) ts homogeneous of degree —1. 
1.2. For every u C o the equations u.(r) = Ua (a = 1,---,n + 1) havea 
unique solution x C p. 
1.3. If a new codrdinate system is introduced by x = Gasxg the ug undergo the 
contragredient transformation ug = Cag Ua « 
14. Ifx Cpandy Cop, then ua(x)ya ¥ 9. 

The convexity of R finds its expression in the following theorems. 
1.5. Ifx Cp, y Cp, the equation 


Ua(z)(ALa + Ya) = O 


has either for no z of p or for every z of p a solution with \ > 0, u > 0. 

Proof. x + uy gives for \ > 0, u > 0 one of the two segments bounded by 
zx and y; according as this segment is contained in R or not it will be intersected 
by no or by every hyperplane contained in S. 

1.6. Forz, y,z Cp, 
(1) Ua(2)%a°Us(r)ys*Uy(y)zy > 0; 
in particular for y = z, 

Ualy)Za°Us(x)ys > 0. 

Proof. Let x and y be such that 

Ua(z)(Ata + wYa) ¥ O 


forallz CpandA’\ >0,u>0. As ua(z)(Ata + Ya) represents a continuous 
function of \ and uy, it will have the same sign for \ = 0, 1» = 1 as for \ = 1, 
uw = 0. Thus 


(2) Ua(Z)Xq°Us(z)ys > 0 


2 Throughout this paper the summation convention is used, the letters a, 8, y, --- rang- 
ing over 1, --- ,n + 1, the letters 7, k, 1, --- ranging over 1, --- , n. 
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for allz Cp. In particular, for z = x orz = y, 


(3) Ua(r)Yya > 0, us(y)xs > 0; 
hence 
(4) Ua(r)YaUs(y)xs > 0. 


If, on the other hand, z and y are such that 
Ual(zZ)(ATa + UYa) = O 
for every z C pand some \ > 0, u > 0, we may conclude similarly that for every 
zCop 
(2’) Ua(Z)La*Us(z)ys < 0 
(3’) Ua(Z)Ya < 0, us(y)zs < 0, 


so that (4) holds. As (4) is proved for every two elements of p, we also have 


Us(z)ys-Uy(y)zy > 0. 


Thus from (2), (3), (4) or from (2’), (3’), (4) follows (1). 
Derinition. Let 2° be a fixed element of p. By f we denote the subset of 
all elements y of p for which 


Ua(z ya > 0. 


Obviously for every y C p either y Cpor —y Cf. It follows immediately 
from 1.6, that 
1.7. If x Cpand y C4§, then 


Ua(r)Ya > O. 


Thus A may just as well be generated by any other one of its elements as by 2”. 
18. Ifx Cpandy C§, then dx + py Cpforr > 0,u > 0. 
1.9. Letzx” Cpforvy = 1,2, ---andlimz’ = 2 Cop. 


vo 


Then there exists a subsequence of the u(x’) which converges to some (u) ¥ 0. 
Proof. There is certainly a subsequence of the u(x’) for which 


v = u(z’) [Xo u(x’)} + 


converges toward some v ~ 0. As u,(x’)a, = 1 (v not summed), we have for 
this subsequence 


vt, = > u2(2’)}, Vata = lim [> ui(2’)}?. 


The limit on the right side must be ¥ 0, for the plane vzy. = 0 cannot contain 
an interior point of R, as it is the limit of exterior planes of R. Thus [> ui (x”)}* 
converges for that subsequence towards a limit ~ 0, and therefore the cor- 
responding u(x”) converge toward some u + 0. 

1.10. pis an open set. 
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Proof. If x Cf and x = lim 2’, then lim ua(x)z, = ue(x)ra = 1. Thus 


yo ven 


Ua(x)r, > 0 for sufficiently large v;i.e., x” C p for sufficiently large v. 


2. Derinition. The P.C. is called positive if for any x C p, y C p and 
xuFy 
(6) Ua(Z)Ya*Us(y)xs > 1. 
(Compare with 1.6.) 

Let P and Q be the points of R corresponding to x and y; let P’ and Q’ be 


the points of intersection of PQ with the polars of P and Q, respectively. Then 
the cross ratio (PP’/QQ’) is given by 
ae 1 
Ualy) La" Us(x) Ys” 

Thus (6) means that the pair of points PP’ is not separated by the pair of points 
QQ’. 
2.1. Ina positive P.C. the functions ua(%1, +++ , Xn41) are continuous. 

Proof. Let 2° C p for vy = 1, 2,--- and let lim 2” = x Cp. We have to 


vos 


prove that lim u(x’) = u(x). According to 1.9 a subsequence of the u(z”) will 





converge towards some v # 0. We restrict ourselves to the consideration of 
that subsequence, which may again be denoted by u(z’). Our statement will 
be proved if v = u(x). Let us assume that lim u(x”) = v ¥ u(x). There are 


two possible cases: (a) v C o, (b) v is on the boundary of ¢; i.e., corresponds to 
a plane of support of R. 
We deal first with the case (a), where lim u(x”) = v Co. Thereisay Cp 


such that »v = u(y), and y # 2, since u(y) # u(x). Without restriction of 
generality we may assume that x C p. Then according to 1.10 all but a finite 
number of the x”, which may be neglected, are contained in p. From lim 2” = z, 


yee 


lim u(x”) = u(y), and u.(2")r, = 1, it follows that 


Ualy)ta = 1; 


therefore y Cp. Moreover, for any z C p, 


IV 


Ua(z)r,-Ug(x")z3 = 1; 


thus forvy > «, 


IV 


Ua(Z)La*Us(y)Ze :. 


Let z = pr + Ay, where A > 0,4 > 0. Thenz Cfand 
1 S [way + ux) ra][us(y)(Ay + uars)] 


7) 
; [wa(ua + AYy)®al(u + A). 
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On the other hand, 


Ua( wx + Ay) La = Ua( mx + Ay) (ura + Aya) = * alae + AY) Ya 


Ua(ux + AY)Ya 


1 





1 
Ku 
1 
K 
<! 
m 


by use of (6) and the fact that wa(y)(ura + AYa) > 0, since zandy Cp. The 
right member is 


B.. a 1 1 


ho pet pr 
But the last result contradicts (7). 
In case (b), we have lim u(x”) = v, where vaya = 0 is the equation of a plane 


von 





of support of R (or of p). Then there exists a y on the boundary of p such that 
VaYa = 0; y may be chosen in such a way that u.(x)y. > 0. Then for A > 0, 
n> 0, 


Ua(Z) (uta + Ma) = w+ AUa(Z)Ya > 0, 
i.e., ux + Ay C ffor all positive u, A. Then 
Ua(ux + Ay)(mi Ta + MYa) > O, 
if uw, and \, are positive. Thus for u, — 0 we have 
(8) Ua(ut + Ay)Ya > O, 
since Ua(ux + Ay)Ya = Vis not possible. On the other hand, it follows from 


Ua(ux + Ay)a-Us(z")(uas + dys) 2 1 


IV 


forv— that 


— 


Ua(ux + AY)La(uXs + Ays)vs = 
AS Vala = 1, VaYa = 0, we have 
Ua(ur + Ay)ura 21; 
subtracting this from wa(ux + Ay)(ura + AYa) = 1, we obtain 
Ua(ux + Ay)Ya SO, 
in contradiction to (8). 


3. Let the P.C. be positive. Let 


(9) U(e, y) = | aa{(l — De + QMye — 20a. 
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U(x, y) is defined for x C p and y C §, as then also (1 — t)r + ty Cp for 
0 < t S 1, according to 1.8. 


3.1. U(x, y) = —Uly, 2), U(x, Ay) = u(, ). 


DerFINITION. A positive P.C. is called symmetric, if for all x, y,z CA 


(10) U(z, y) + Uy, z) + Ul, x) = 0. 
3.2. Ina positive symmetric P.C. 
aU (x, y) = Ua(y). 
OYa 


Proof. From (10) we obtain for z; = ys + 62h 


U(a, z) — U(z, y) = Uy, 2) = [ Ualy + (z — y)tlhadt 
or 


fn U(x, z) — U(a, y) 
h—0 h 





= lim [ Ualy + (z — y)tldt = ualy). 


3.3. A positive P.C. is symmetric if there exists a function u(x, +++ , Xn) > O 
defined in p such that 


d log u 
11 ——— 
(11) a 


Proof. If there is a function u satisfying (11), then 


(2. y) = log YY 
(12) U(z, y) = log ula)’ 
and (10) will be obviously satisfied. If on the other hand (10) holds, the fune- 
tion u(y) defined by 


(13) u(y) = e u(x) (u(x) > 0), 


where z is some point of p, will satisfy (11) according to 3.2, and will be positive. 
It follows from (12) that all solutions u of (11) can only differ by a constant 
factor and are given by (13). 
3.4. The function u(x) is homogeneous of degree one. 
Proof. Let u be given by (13). According to (10) for \ > 0, 


u(rAy) = el ™ u(r) _ eV FW +UodW (7) “ss u(yer™, 


Now from 1.1 and (9) it is easily deduced that U'(y, Ay) = log X. 

Let ns: denote the projective 2; +--+ 2n41 y-space and let Y be the point 
ti = 2 = +++ = Laisa = O and z, the plane y = 0. Then the equation y = 
u(%1, °** , Xn41) for x C f defines, according to 3.4, a certain surface 2 in m,4:. 
As u has continuous derivatives, = has a continuously changing tangent plane 
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at every point. Moreover, for every point Q of = the line YQ intersects y = 0 
in a point of R. 
3.5. IPf Q is a point of =, then the point of intersection of the line YQ with x, and 
the (n — 1)-flat of intersection of the tangent plane in Q with x, are respectively 
pole and polar in our P.C. 

Proof. The intersection of the tangent plane in Q with y = 0 is given by 


au 
‘a az. = VU, 
or according to (11) by ua(x)ya = 0. 
3.6. > is a convex surface with only regular points and planes of support.’ 

Proof. Let P; and P, C R. The 2-flat through P; P2Y intersects = along 
some curve L. From the geometrical interpretation of the positiveness of the 
P.C. (ef. the definition, §2) it follows that L is a convex curve; for if P;, P2 are 
the points of intersection of the line P; P2 with the tangents of L in the points of 
intersection of P,Y and P:Y with L, then P, P{ and P2P; do not separate each 
other. = is a convex surface, as it is intersected in a convex curve by every 
2-flat through Y. The regularity of the points and planes of support of = 
follows immediately from the one-to-one character of the P.C. 

The following inverse statement is easily proved: 

3.7. Any surface = in mp4, will generate a positive, symmetric P.C. with respect 
to the convex region R in x, in the manner of 3.5, if 

(a) for every point Q of = the line YQ intersects x, in a point of R; 

(b) = together with R and the boundary of R forms the boundary of a convex 
region in Wir} 

(c) = has no points in common with rp ; 

(d) = has only regular points and planes of support. 


4. Throughout this section we assume that the P.C. is positive and that the 
functions u,(x) have continuous first derivatives. We put 


Derrinition. The correlation in which a point £ corresponds to the hyper- 
plane given by 


(14) [uas(x) + 2u4(x)us(x)] Ea a” 0 
will be denoted as the tangential correlation at z. If we write 


(ze) _ [Usalz) + 2ua(x)us(x)] fs 
(15) val) = i a(@) + Quy (w)eg(x)] Ey ba’ 





3 Regular points are points through which only one plane of support passes; regular 
planes of support are planes containing only one point of contact. Cf. the definition in 
Bonnesen and Fenchel, Konvere Kérper, pp. 13-15. 
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the plane va(~)n« = 0 corresponds to the point & in the tangential correlation 
and v,()& = 1. 
4.1. (a) va(z) = u(x), i.e., the polar of x is the same for the P.C. and the tan- 
gential correlation at (x); 
(b) va(é)ta = 0 implies 0 = va(x)Ea, t.€., in the tangential correlation the 
poles of planes through (x) lie on the polar of (x); 
(ce) vas(x) = (=) = Uaa(2), 2.e. the tangential correlation gives the best 
B sims 
approximation to the P.C. in the neighbourhood of x. 
Proof. From 1.1 and (A) it follows that 


Uas(X)La = Uga(L)La = —Us(Z)," Uag(X)%aXs = —Us(x)rg = —1. 


4.2. When ua has continuous derivatives, the P.C. is symmetrical, if and only if 
Uas(t) = Usa(x). 

Proof. U(x, y) is the line integral of the function u over the straight line 
segment joining the points z and y of j. 

Since for wag = Usa the tangential correlation (14) is the polar correspondence 
with respect to a quadric, we see that a P.C. is symmetric if in the neighborhood 
of a point it can be approximated by an ordinary polar correspondence with 
respect to a quadric. This quadric, which is given by 


[Was(x) + 2ua(x)us(x)lEats = 0, 


will be called the tangential quadric at x. It follows from 4.1 that 
4.3. If = is the surface in 1,4: generating a symmetric P.C., the tangential quadric 
at a point P of R is the intersection of +, with that (n + 1)-dimensional quadric 
which (a) has a contact of the second order with > at the point of intersection of PY 
with > and (b) for which Y is the pole of the plane r,, . 
4.4. If the tangential quadric at x in a positive symmetric P.C. is non-degenerate, 
it has no point in common with the polar of x. (This implies that the tangential 
quadric 3s projectively equivalent to an ellipsoid.) 

Proof. According to (6), 


Ualt)La* us(x) Es 2 1 ; 


in particular for § = x + dz, if we neglect terms of higher than the second order 
in dz, 


1 S [wa(x + dr)xa)[us(x)(xs + drs) 

= [1 — ua(x + dx)dzq\[1 + us(x)drs) 
[1 — we(x)dtq — Uay(x)dradry + ---)[1 + us(x)dzz] 
1 — [ua(x)dra}’ — tay(x)drqdx, + -:- 


Consequently for any dx = 7, 


[was() + 2va(x)us(x)] nans S [wa(x) nal - 
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From this inequality it follows that, provided the quadric 
[was(t) + 2ua(r)us(z)] nans = 0 


is non-degenerate, it can have no points in common with the plane u.(x)n. = 0. 


5. Let the convex region R be covered with mass of density 4. We assume 
that u is a continuous positive function in R and on its boundary, and that yu is 
invariant under collineations. 

Derinition. Let p be any hyperplane of S. If p is taken as plane at 
infinity in a non-homogeneous coérdinate system, R will have a certain center 
of mass P under the given mass distribution. P will be called the pole of mass 
of p and p the polar of mass of P. 

If an ellipsoid or a simplex is covered with homogeneous mass, the pole of 
mass of a plane coincides with the pole in the ordinary definition. 

As a consequence of the well-known theorem that the center of mass of a con- 
vex region with a positive mass distribution lies in the interior of the region, 
and is invariant under affine transformations, we have: 

5.1. Every plane of S has a uniquely determined pole of mass, which is contained 
in R. 

In-this and the following paragraphs we shall always refer R to a non-homo- 
geneous coordinate system 2, , --- , 2, , the plane at infinity being a plane of S 
and the origin a point of R. Let gi; = 1 be the equation of a plane p of S. 
In order to calculate the pole of mass P of p, we introduce new non-homogeneous 
coordinates 

, Zi 


4 = —— (¢=1,---,n) 
1 — qer 


such that p becomes plane at infinity in z’-space. If &, ---, & denote the co- 
érdinates of the center of mass of RF in z’-space referred to zx’ coédrdinates and 
&,,---, & the codrdinates of the same point referred to the z-system, we have 


[ pr, dz, +++ dri, 








f; alee ’ 
. [ pdel +++ dal 
7” 
or in the z-system, 
zs 
Y d . 
é, Sos J dx; x 
1 a — ’ 
be [ nsan dzxp, 
R 
when 
_ A(t, +++, tn) 1 





k 
7 a(n, sikeleae | Xn) 9 (1 ont qx1)?" | 651 ma Qm Xm) + ri | 


= (1 = nx)". 
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Thus 


é, [asc — gai)” day +++ dx, 
(16) ——— = t 


i- ” 
Gabe [ua — qx) "dx, +++ dz, 
R 





Let us write 


(17) FQ, *** 5) = (| u(l — mai) "dx, ++: dz) 7, 
R 


If F; stands for dF /dq;, (16) may be written 
(18) ro ee 


Solving for £;, we finally obtain 


(19) t= 


F(q, *** , Qn) is defined for all (q:, --+ , dn) for which q,2, < 1 for all (x, 
-,2,) C R;i.e., for all q contained in the polar region R’ of R with respect 
to the unit sphere. F gives essentially the mass of R in any coérdinate system. 


5.2. F(q) = F(q, +++ , dn) is strictly concave for gq C R’. 
Proof. We have to prove that for 0 < 3 < Landq’ # q” 


F[dq’ + (1 — 8)q"] > 8F(q’) + (1 — 8)F(Q"). 











Let 
1 ee 121 1 qi t1 
a(x) = —" i) = 2S. 
n+l 1 
M m 
iia += (8g: + a = O)qu dar 
n+l 
Me 


Then c(x) = da(x) + (1 — #8)b(x) and 


1 


F{dq’ + (1 — dq") = | [ c"*) (2) dar ++ az, | 
R 


- if [va + (1 — vb] "*? day, - 


> if [da] “"*” dx, --- ie.) +} if [a — v)do]-"*? da, --- 


= 0F(q') + (1 — 8) F(q”) 
according to Minkowski’s inequality.‘ 


‘Cf. Hardy, Littlewood, Pélya, Inequalities, p. 146, Theorem 198. 
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Obviously the equation F = F(q, --- , gn) will be the equation of an open, 
concave surface >’ in q, --- q,F-space lying above the region R’. Every 
parallel to the F-axis through a point of R’ will intersect 2’ in a single point. 
As F is strictly concave, only one point of >’ will be on every plane of support 
of =’. As F has continuous derivatives, there will pass only one plane of support 
through every point of =’. As u > 0 on the boundary of R, F will tend towards 
0, if (q1, --- ,@n) approaches the boundary of R’. Thus >’ together with R’ 
and its boundary will form the boundary of a convex region in q; --- g, F-space. 
Hence through every (n — 1)-flat of F = 0 outside R’ will pass one plane of 
support of 2’ having a point of contact on L’ (not on its boundary). 

Let = be the polar reciprocal of >’, i.e., the surface consisting of the poles of 
the planes of support of =’ with respect to the unit sphere.’ = will be given in 
tangential coérdinates by the equation 


(20) qiki + nF(M, +++, 4) = 1 (qq CR’), 


&,---,&, » denoting the codrdinates of a point of a tangential plane of >. 
By dualizing the previous statements about >’, we see that = will be a concave 
surface with only regular points and planes of support; through every (n — 1)- 
flat outside R will pass exactly one plane of support of = having one point in 
common with =. Every parallel to the 7-axis through a point of R will intersect 
y, ie., 2 lies above R. If q;& = 1 is any (n — 1)-flat p in » = 0 outside R, 
the coérdinates of the point of contact of the plane of support of = through p 
are given by (20) and the equations 


(21) & + nF, = 0 (k = 1,---,n). 


From (20) and (21) we find (19). 

Accordingly, the projection of the point of contact is the pole of mass of p. 
Hence the correspondence between poles of mass and polars of mass is generated 
by the surface = in the manner of 3.5, if the point at infinity of the y-axis is 
taken as Y. Thus according to 3.7: 

5.3. The correspondence between poles of mass and planes of mass is a positive 
symmetric P.C. 

Incidentally we have proved that every point of R is pole of one hyperplane 
in S; in other words, that for every point P of R there is a collineation such that 
the image of P becomes center of mass of the transformed region. This might 
have been more directly concluded from (19) and 5.2 in the following way. 


It is sufficient to prove that the origin & = & = --- = & = 0, which was an 
arbitrary point of R, is pole of mass of some plane qg,z, = 1. Thus according 
to (19) we have to prove that the equations F; = 0 (¢ = 1, --- , n) have a solu- 


tion, i.e., that F is stationary for some q C R’. Now, as F is a concave positive 
function in R’ and F = 0 on the boundary of R’, F will have a maximum in R’; 
this maximum will be attained at only one point of R’, as F is strictly concave. 
Thus the origin will be pole of mass of exactly one plane. 


> Cf. Bonnesen and Fenchel, loc. cit., p. 28. 
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Let us calculate the tangential quadric of the P.C. (19) at a point P of R with 
coérdinates (23, ---,2%). Let qfy; = 1 be the equation of the polar p of P. 
The tangential quadric Q at P is according to §4 characterized by the conditions 
that the codrdinates z; of a point and their first derivatives x with respect 
to the coérdinates q, of its polar plane have for q = q° the same values under the 
polar correspondence (19) and under the polar correspondence with respect to Q. 
In order to simplify the calculation we assume that P is origin and p plane 
at infinity of our non-homogeneous coérdinate system. Then zx} = 0, qh. = 0. 
Q is given in tangential coérdinates by 
(22) TKQidk = 3 


. 


For the pole (2, --- , ,) of a plane qg;y; = 1 with respect to the quadric (22) 
is given by 


Li = LikQk 5 


hence for gq = @ = -*: = qn = 0 
z; = 0, ae = Tix 
04% 
According to (19) we have for q; = g2 = +--+ = qn, = Oanda, = --- =z, =0 
F; = 0, pen et 

aq% FP 0% 

Thus the tangential quadric is given by 
‘ai% + F = 0. 





Og 


Substituting for F its expression (17), we obtain for g; = 0 
1 


F= (/ pda, dieiie a2), oF = —(n + gee | px r,dx; es dx. 
R 0" R 





Thus the equation of Q in tangential codrdinates becomes 


(23) [was +++ dz, = (n + 2) [ w(xqi) dx, +++ day. 


Q is by definition covariant under affine transformations. Thus we may 
assume, without restriction of generality, that Q is the unit sphere g; + 93 + --- 
+q, =1. In that case we must have 


(24) (n + 2) [ prj r,dxz, +++ dz, = at | pdx, +--+ dz,. 
R R 
These equations may be interpreted as stating that the moment of inertia of R 


with respect to any axis is the same as that of the homogeneous unit sphere 
of same mass as R. Since the property that two regions have the same moment 





ee 








y 





aia mw NET te Ne oe 





POLAR CORRESPONDENCE 367 


of inertia with respect to every axis is invariant under affine transformations, 
we see that irrespective of the equations (24) R has the same moment of inertia 
as Q with respect to every axis, if Q is covered with homogeneous mass of same 
total mass as R, i.e., we have proved 
5.4. The tangential quadric at a point P of R is identical with Legendre’s ellipsoid 
of inertia of R, if the polar of mass of P is taken as plane at infinity, i.e., if P is 
the center of mass of R.° 

If we denote as the generalized ellipsoid of inertia of R in P a quadrie which 
is covariant under collineations and coincides with Legendre’s ellipsoid of 
inertia in case P is the center of mass of R, the tangential quadric will be identical 
with the generalized ellipsoid of inertia, i.e., the P.C. with respect to R is in the 
neighborhood of P approximately given by the P.C. with respect to the general- 
ized ellipsoid of inertia in P. 

Let us again make use of the special non-homogeneous coérdinate system, 
in which Q is the unit sphere about P, i.e., in which (24) holds. It then follows 
that 


(25) n+2) [alt + ++ ra)day +++ dt, = m ff wis +> da. 
R R 


From this equation it follows that the inequality 


h+--- +8 <a 
cannot hold for all points of R. Hence: 
5.5.1. If R is enlarged in the ratio (n + 2)':n', it is not contained in Legendre’s 
ellipsoid of inertia of R. 

5.5.1 may be expressed in the projectively invariant form: 
5.5.2. Let Q be the tangential quadric at a point P with polar of mass p. Then 
there is at least one point S of R, such that the cross ratio (P, p/S, s) = n/(n + 2), 
where s is the polar of S with respect to Q. 

5.5.2 is remarkable, because it gives an inequality for the tangential quadric 
in a P.C. generated by a mass distribution, which seems not to be satisfied in 
every positive, symmetric P.C. 


6. Derimition. The P.C. corresponding to a homogeneous mass-distribu- 
tion (u = const.) may be denoted as the principal P.C. with respect to R. 
6.1.1. Let P be a point of R and p its polar of mass in the principal P.C. Let 


6 Legendre’s ellipsoid of inertia of a body is defined as the homogeneous ellipsoid having 
the same moment of inertia with respect to every axis as the body. (Cf. Blaschke, Ber. 
Verh. sachs. Akad. d. Wiss., vol. 70 (1918), pp. 72-75.) Legendre’s ellipsoid is essentially 
the polar reciprocal of Binet’s fundamental ellipsoid of inertia, which is characterized by 
the property that the moment of inertia with respect to any plane through its center is 
inversely proportional to the square of a radius vector perpendicular to it. (Cf., e.g., 
A. G. Webster, Dynamics, p. 231.) 
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f be any (n — 2)-flat contained in p. If then q; and qy denote the planes of support 
of R passing through f and q; the plane through P and f, then 


sw 


1 
—n & (pq;/aa) & . 


(If R is a quadric, the cross ratio always has the value —1.) 

Proof. 6.1.1 is the projective formulation of the following affine theorem 
given by Minkowski: the distance of the center of mass of a homogeneous convex 
body from any plane of support s lies between B/(n + 1) and nB/(n + 1), where 
B is the distance of s from the plane of support parallel to s. 

The following dual theorem is proved similarly. 

6.1.2. Given a point P of R and its polar p under the principal P.C. Let F 
denote any line through P and let Qr, Qr , Qr be respectively its points of inter- 
section with p and with the boundary of R. Then 


1 
—n S (PQr/QrQr) S or 

If uw is constant and if a non-homogeneous coérdinate system is chosen, in 
which Legendre’s ellipsoid of inertia is the unit sphere about P, according to (25) 


(26) m+2 | (xi +--+ + 22)dx,--- dz, = n [an --- dz, = nV, 
R R 


V being the volume of R. Let us introduce polar coérdinates by x; = ré;, 
where (£, --- , &:) is a point of the unit sphere 2. If dw is the element of sur- 
face of 2, (26) may be written 


(27) [ora = [orae, 
2 2 


where r = p(&, --- , &) is the equation of the boundary of R. From (27) it 
follows that p can neither be > 1 nor < 1 for all points of the boundary of R. 
Thus the boundary of R has certainly points in common with the boundary of 
the unit sphere. This proves the following projectively invariant property 
of the principal P.C. 
6.2. The boundary of R is intersected by the boundary of every tangential quadric 
in the principal P.C. 

Compare with 5.5.2 and 4.4. 

Some other consequence of (26) may be pointed out. Let again the unit 
sphere about P be Legendre’s ellipsoid of inertia of R. Let = be the sphere of 
volume V about P. Then 


(28) I r'dx, +++ dz, = [ dx, +++ dxn, 
R z 


7 Cf. Bonnesen and Fenchel, Konvexe Kérper, p. 52. 
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as r° is less in any point of = not in R than in any point of R not in >. If ps 
is the radius of 2, then 





2 n 2 > 
dz, +++ dz, = 2 Ly. 
[: Zi dx 3 J 


Thus according to (26) and (28) 
V2 p:V, 


i.€., py S 1; this means that the sphere of volume V has no greater radius than 
the unit sphere, and consequently the volume of the unit sphere is not less than 
that of R. As Legendre’s ellipsoid is covariant under affine transformations, 
we have thus proved the following theorem of Blaschke: 

6.3. The volume of Legendre’s ellipsoid of inertia of R is not less than that of R.° 

Every theorem on moments of inertia of homogeneous convex regions may be 
interpreted as a statement on the tangential quadrics in a principal P.C. Cf., 
e.g., in this connection the author’s paper Moments of inertia of convex regions, 
this Journal, vol. 2, pp. 447-452. 

Remark. Given an algebraic surface of degree 2m consisting of m closed sur- 
faces containing one another. Let = be the (convex) most interior of these 
surfaces. Then there is a 1-1 correspondence of the points of = and the planes 
outside 2, if the plane corresponding to a point is its ordinary plane polar with 
respect to the algebraic surface. This correspondence is a positive, symmetric 
P.C, 


UNIVERSITY OF KENTUCKY. 


8 Cf. Blaschke, loc. cit. footnote 6, where this theorem appears as a generalization of 
Sylvester’s four-point-problem. The two volumes in 6.3 can only be equal, if F is an ellip- 
soid, as the equality sign in (28) holds only if R is a sphere. The convexity of FR is not 
actually used in the proof. 
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f be any (n — 2)-flat contained in p. If then q; and qj denote the planes of support 
of R passing through f and q; the plane through P and f, then 


, ” 1 
—n S (pq;/ara7) & ae 


(If R is a quadric, the cross ratio always has the value —1.) 

Proof. 6.1.1 is the projective formulation of the following affine theorem 
given by Minkowski: the distance of the center of mass of a homogeneous convex 
body from any plane of support s lies between B/(n + 1) and nB/(n + 1), where 
B is the distance of s from the plane of support parallel to s." 

The following dual theorem is proved similarly. 

6.1.2. Given a point P of R and its polar p under the principal P.C. Let F 
denote any line through P and let Qr, Qr , Q be respectively its points of inter- 
section with p and with the boundary of R. Then 


, 1 

—n S (PQr/QrQr) S as 
If uw is constant and if a non-homogeneous coérdinate system is chosen, in 
which Legendre’s ellipsoid of inertia is the unit sphere about P, according to (25) 


(26) (n+ 2» | (ai +--+ + aa)dx --+ dz, = nf dz, +++ dz, = nV, 
R R 


V being the volume of R. Let us introduce polar coérdinates by x; = ré;, 
where (£, --- , &:) is a point of the unit sphere Q. If dw is the element of sur- 
face of 2, (26) may be written 


(27) [ ody = [ ode, 
2 Q 


where r = p(i, --- , &) is the equation of the boundary of R. From (27) it 
follows that p can neither be > 1 nor < 1 for all points of the boundary of R. 
Thus the boundary of R has certainly points in common with the boundary of 
the unit sphere. This proves the following projectively invariant property 
of the principal P.C. 
6.2. The boundary of R is intersected by the boundary of every tangential quadric 
in the principal P.C. 

Compare with 5.5.2 and 4.4. 

Some other consequence of (26) may be pointed out. Let again the unit 
sphere about P be Legendre’s ellipsoid of inertia of R. Let = be the sphere of 
volume V about P. Then 


s 
~ 


(28) day -+- dr, = [ rdax, +++ dxn, 
R 


7 Cf. Bonnesen and Fenchel, Konveze Kérper, p. 52. 
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as r’ is less in any point of = not in R than in any point of R not in Y. If py 
is the radius of =, then 





[rex --- dz, = —— on ¥. 
z 
Thus according to (26) and (28) 

V =os-V, 


i.€., py S 1; this means that the sphere of volume V has no greater radius than 
the unit sphere, and consequently the volume of the unit sphere is not less than 
that of R. As Legendre’s ellipsoid is covariant under affine transformations, 
we have thus proved the following theorem of Blaschke: 

6.3. The volume of Legendre’s ellipsoid of inertia of R is not less than that of R.° 

Every theorem on moments of inertia of homogeneous convex regions may be 
interpreted as a statement on the tangential quadrics in a principal P.C. Cf., 
e.g., in this connection the author’s paper Moments of inertia of convex regions, 
this Journal, vol. 2, pp. 447-452. 

Remark. Given an algebraic surface of degree 2m consisting of m closed sur- 
faces containing one another. Let = be the (convex) most interior of these 
surfaces. Then there is a 1-1 correspondence of the points of = and the planes 
outside 2, if the plane corresponding to a point is its ordinary plane polar with 
respect to the algebraic surface. This correspondence is a positive, symmetric 
P.C, 


UNIVERSITY OF KENTUCKY. 


8 Cf. Blaschke, loc. cit. footnote 6, where this theorem appears as a generalization of 
Sylvester’s four-point-problem. The two volumes in 6.3 can only be equal, if RF is an ellip- 
soid, as the equality sign in (28) holds only if R isa sphere. The convexity of FR is not 
actually used in the proof. 











INTERIOR TRANSFORMATIONS ON COMPACT SETS 
By G. T. WHysurN 


i. As originally defined by Stoilow,’ a single-valued continuous transforma- 
tion 7(A) = B is called an interior transformation provided (1) the image of 
every open set in A is open in B and (2) the inverse set 7~'(b) of each point b 
in B is totally disconnected. Most later writers have omitted condition (2) 
and have spoken of an interior transformation as one satisfying (1) alone. 
While this latter point of view will be adhered to in this paper, it turns out that 
in most cases the hypotheses in the theorems are such as to make both (1) and 
(2) satisfied. In order to save words we shall call a continuous transformation 
satisfying (2) a light transformation. Hence a light interior transformation in our 
terminology is the same as an interior transformation as defined by Stoilow. 

Our object will be to develop fundamental properties of interior transforma- 
tions as applied to compact metric sets with a minimum of restriction on the 
image set. All our sets are assumed to lie in a metric space. We begin with 
some basic lemmas and theorems. 

(1.1) Lemma. Let T(R) = S be single-valued and let Q be a subset of R such 
that 


(i) Q = T'T(Q). 
Then for any subset X of R we have 
(ii) T(Q:X) = T(Q)-T(X). 


Proof. Obviously T7(Q-X) C T(Q)-T(X). To prove the reverse inclusion, 
let z € T(Q)-T(X). Then T(x) CQ, T"(x)-X ¥ 0. Thus if ye T(z)-X, 
we have y e€Q-X-T™'(z), a relation which gives T(y) = ze T(Q-X). 

(1.2) Lemma. Jf T(R) = S is an interior transformation on R, and Q is any 
subset of R satisfying (i), then T is an interior transformation on Q. 

Proof. Let X, be any open subset of Q. There exists an open subset X of 
R such that X, = X-Q. By hypothesis T(X) = U is open in S, and by (1.1) 


T(X,) = TQ-X) = TQ)-T(X) = TQ)-U. 


Thus 7'(X,) is open in 7(Q) and the lemma follows. 
(1.3) Lemma. If A is compact and T(A) = B is continuous, then for any set 
RCA 
(i) T(R) = T(R), 
(ii) T(R) — T(R) CT(R — R). 
Received March 19, 1937. 


1 See Stoilow, Annales Scientifiques de |’Ecole Normale Supérieure, vol. 63 (1928), pp. 
347-382 and Annales de |’ Institut Henri Poincaré, vol. 2 (1932), pp. 233-266. 
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Thus if T is an interior transformation and R is open in A, 
(ili) F(T(R)] C T[F(R)], 


where in general F(X) denotes the boundary of X. 

(1.31) Corottary. The Menger-Urysohn order of a point in A is never in- 
creased when A undergoes an interior transformation. The properties of being 
(i) a curve of order S n, (ii) a regular curve, (iii) a rational curve, are invariant 
under interior transformations. 

(1.4) THrorem. Let A and B be compact and let T(A) = B be interior. Then 
if R is any quasi-connected open set in B, every quasi-component @ Q of T™'(R) 
maps onto all of R under T. 

Proof. Suppose, on the contrary, that for some p « R, T-'(p)-Q = 0. Then 
since T”'(p) is compact and is C 7”"(R), it follows by an application of the 
Borel theorem that there exists an open set U such that 


(i) QCcCUCT *"(R), F(U)-T"(R) = 0, U-T"(p) = 0. 
Now T(U) = V is open, and we have 
(ii) V>T7(Q) CR, VCR-p, 


whence, since R is quasi-connected, F(V)-R # 0. But by Lemma (1.3), 
F(V) C T[F(U)] and by (i), T[F(U)]-R = 0. Thus our theorem is proved. 

(1.41) Corotuary. If A is locally connected, if Bo is any closed set in B and 
R is any component of B — By , then T~*(R) has just a finite number of components 
and each one of these components maps onto all of R under T. 

(1.42) Corottary. If A is locally connected and B is connected and cyclic 
(i.e., without cut points), then for each x « B, A — T~*(x) has just a finite number 
of components and each of these components maps onto B — x under T. 

(1.5) THrorem. Let T(A) = B be an interior transformation, where A is 
compact, and let C be any continuum in B. Then for each component K of T~*(C) 
= Q, we have T(K) = C. 

For by (1.2), the transformation 7(Q) = C is interior. Thus, since C is 
connected and open in C, it follows by (1.4) that each component of Q maps onto 
all of C under T. 

(1.6) Let T(A) = T2T,(A) = B, where T;(A) = A’ and T:(A’) = B are 
continuous. If T is interior, so also is T? . 

For let R be any open set in A’ and let Q = 77'(R). Then since 7; is con- 
tinuous, Q is open in A; and since 7’ is interior, T(Q) isopenin B. But 7(Q) = 
T:T,(Q) = T.(R). Accordingly, 7:(R) is open in B. 

(1.61) Corottary. If A is compact, any interior transformation T(A) = B 
can be factored into the form T(A) = T2T,(A), where T; is monotone’ and 1: is 
interior and light. 


* A transformation 7(A) = B is monotone provided that for each b « B, T-(b) is con- 
nected. 
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For T can be factored into the form 7.7), where T; is monotone and 7% is 
light by a known theorem;’ and by (1.6), 72 is necessarily interior. 


2. Mapping of the Betti groups. ilenberg* has raised the question whether 
under an interior transformation T(X) = Y, where X is compact, the Betti 
groups of X map homomorphically onto the corresponding groups of Y. With 
respect to the integral coefficient domain G, this question is easily answered 
in the negative by the transformation w = 2° of the circle |z| = 1 into the 
circle | w | = 1, since in this case clearly’ By(X) maps onto 2B4(Y). 

However, if we consider the Betti groups relative to the rational coefficient 
field R, this question is not nearly so easy to answer. In the above case, clearly 
B}(X) maps homomorphically onto B,(Y), and indeed it will be proved in the 
next section that this is always true for any linear graph X undergoing an interior 
transformation.® 

The following example is of interest in this connection as well as in connec- 
tion with a theorem to be proved below in §4. 

(2.1) Exampie. There exists a compact continuum K with a vanishing one- 
dimensional integral Betti group and an interior transformation f(K) = J of 
K onto a circle J. 

To exhibit this, we employ a continuum K which has been constructed by 
Vietoris’ for a different purpose. This continuum is constructed by first taking 
a Cantor ternary set on both the upper and lower bases of the unit square with 
vertices (0, 0), (1, 0), (1, 1), (0, 1) and joining corresponding points in these 
two sets (i.e., points with the same abscissa) by the altitude of the square they 
determine. This gives a set C which may be considered a “Cantor set of unit 
intervals”. K is then constructed from C as follows. We identify the lower 
endpoints of the left half of these intervals with the upper endpoints of the 
right half in the same sense from left to right; then divide the remaining lower 
endpoints in half and identify the left half with the upper right half of the re- 
maining upper endpoints in the same sense; next divide the remaining lower 
endpoints in half and identify the lower left half with the upper right half of the 
remaining upper endpoints, and so on indefinitely. Finally identify the points 
(1, 0) and (0, 1). 

Vietoris (loc. cit.) has shown that the one-dimensional integral Betti group 
of this continuum K reduces to the null element. 

We now proceed to define an interior transformation f which will map K 


3 See Eilenberg, Fundamenta Mathematicae, vol. 22 (1934), p. 292 and G. T. Whyburn, 
American Journal of Mathematics, vol. 56 (1934), p. 297. 

4 See Fundamenta Mathematicae, vol. 24 (1935), p. 175. 

5 We employ here the notation (see Alexandroff-Hopf, Topologie) B}(K) for the r-dimen- 
sional Betti group of a complex (or set) K relative to a coefficient domain J. 

6 Results obtained recently indicate that this also holds for any compact set X with 
p(X) finite. This is contrary to a statement made by the author in an abstract in the 
Bulletin of the American Mathematical Society, vol. 43 (1937), p. 183. 

7 See Mathematische Annalen, vol. 97 (1927), p. 459. 
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onto the unit circle J. Let fi(C) = K be the (continuous) transformation 
representing the above “identifying” construction of K from the set C. Let 
p « K and let (2, y) ef;'(p), where x and y are Cartesian coérdinates. We define 


f(p) = @’, y’), 


where 


x’ = cos ry x’ = cos r(1 + y) 
for x < 3, and ‘ for z > }. 
’ = sin r(1 + y) 


, 


y’ = sin ry y 


Clearly this transformation is interior and light and it maps K onto the circle 
x” 4 y” = 1. 


3. Linear graphs. 

(3.1) THrorem. Let A be a linear graph and let f(A) = B be interior. Then 
B is a linear graph and there exist subdivisions of A and B respectively into finite 
complexes K, and Ky, such that f maps each edge of K, topologically into an edge 
in Ky. Thus f(Ka) = Ky is a simplicial transformation. Furthermore, if J 
is any simple closed curve in B, there exists a simple closed curve C in A such that 
f(C) = J and, on C, f is topologically equivalent® to the transformation w = 2* 
on |z| = 1 (k an integer). Consequently B,(K.) maps homomorphically onto 
B(Ko) under f. 

Proof. Since A has only a finite number of points of order > 2 and has no 
point of increasing order, and since, by (1.31), the order of no point can be 
increased under f, it follows that B has these same properties. Accordingly, 
B is a linear graph. 

Furthermore, for each b « B, f(b) must be a finite set of points. For other- 
wise there would exist infinitely many components of A — f '(b), and clearly 
this is impossible, since by (1.41) the inverse of each one of the finite number 
of components of B — b is a finite set of components of A — 7~*(b). 

Now let E, and E; respectively denote the sets of points in A and B of order 
~ 2. Let DD = E, + f(£.) + one point from each component of B — [Ey + 
f(E.)]| which has at most one limit point in E, + f(E.); let D. = f (De). The 
sets D, and D, subdivide A and B into complexes K, and K,, respectively. 
Let zy be any edge of K, and let f(z) = 2’, f(y) = y’, f(zy) = z’y’. Then 
x’ ~ y’, since otherwise f(xy — x — y) would be a free are in B having just one 
endpoint, which is impossible by the choice of D,. Furthermore, for any 
pex'y’ — x’ — y',f '(p)-xy reduces to just one point. For if not, there would 
exist an open arc pip. in zy — zy-f '(p) with p: + pe Cf “(p); this would give 
f(ip2) DP x’ or f(fip2) DP y’; and both are impossible, since f“(z’)-zy = 2, 


8’ Two transformations 7(A) = B and W(A’) = B’ are said to be topologically equivalent 
provided there exist topological transformations H,(A) = A’ and H;(B’) = B such that 
T(z) = H,WH,(z) for every x<¢ A. See my paper Completely alternating transformations, 
Fundamenta Mathematicae. vol. 27 (1936), pp. 140-146. 
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f'(y’)-ty = y’. Accordingly, f(zy) = 2’y’ is topological and f(K.) = Ky is 
simplicial. 

Now let J be any simple closed curve in B, let the vertices of K, on J be 
ordered cyclically po, Pi, «*- , Pn» Po and let qo ef (po). There exists an edge 
gm of K, which is contained in f~'(pop;) and such that q €f'(p:). Similarly 
there is an edge qig2 with q:q2 Cf (pipe) and q2 ef (pe), andsoontop,. There 
is an edge quqo With qngo Cf '(pnpo) and qi) ef (po). If qo = q, we can bring 
our selection to a close. If not, we choose an edge qoqi inf ‘(popi). Again if 
qi = m1, We may stop. Otherwise we choose an edge q:q2 in f ‘(pi pz), and so on. 
Continuing this process, since f'(b) is a finite set for each b ¢ B, after a finite 
number of steps we must eventually reach, for the first time, a point qj (i < n) 
such that qi = qj (m <j). Then clearly the edges 
mt+1 m+1 m+1 m+1 m+1 m+1 m+1 _m+2 
0 1 » M1 2 


Qo 4 G2 5 °** 5 QUn-19n 5» In vet, diag, data, 


QitiQit2, °°", qn 

fit together to form a simple closed curve C which maps onto J exactly 7 — m 
times; and since g: 9341 maps topologically onto p,p,4:, it follows at once that 
if we set j — m = k, the transformation f(C) = J is topologically equivalent 
to the transformation w = 2‘ on the circle | z | = 1 of the complex z-plane. 

Since, as shown above, f can be considered as a simplicial transformation of 
K, onto K,, it follows that f generates a homomorphism, which we shail also 
call f, of B(Ka) into Bk(K»). To see that actually f[B(K.)] = Br(K»), let 
us take any rational l-cycle z in K,. As is well known, z can be expressed as a 
linear form 


Z = 2 + Aeze + es + An2n (a; rational), 


where z; is a simply oriented simple closed polygon. By the above, we can 
find a simple closed curve C; in A such that f(c;) = k:z; (¢ S n, k; an integer), 
where ¢; is an oriented fundamental cycle on C;. Accordingly if we set 


Qn 
Kn 


then y is a rational 1-cycle in K, and f(y) = z. Thus the homology class of 
7 in B)(K.) maps onto the homology class of z in Bz(K») and our result follows. 

(3.11) Corotuary. p'(Ks) < p'(Ka) (where p'(X) denotes the r-dimensional 
Betti number of X). 

(3.2) If A is a simple closed curve and T(A) = B is interior, then B is either 
a simple closed curve or a simple arc. If B is a simple closed curve, there exists 
an integer k such that T is topologically equivalent to the transformation w = 2* 
on the circle|z|=1. If Bis anarce, there exists an integer k such that T is equiva- 
lent to the transformation f(1, 6) = sin k6/2 of the circle p = 1 into the interval 
(—1, 1). 

Proof. It follows from (1.31) that every point of B is of order S 2. Ac- 
cordingly, B is either a simple closed curve or a simple arc. 


Cny 


a a 
yepatpat::: + 














INTERIOR TRANSFORMATIONS ON COMPACT SETS 375 


If B is a simple closed curve, it follows from (3.1) by taking J = B that T is 
equivalent to w = 2‘ on | z| = 1 for some integer k. 

If B is a simple are zy, referring back to the proof of (3.1), we see that D, = 
x + yand hence D, = T(x) + T(y). Thus if we order the points of D, 
cyclically on A: pi, po, -** , Pe, Pr, Where p, e T'(zx), then by (3.1), pipe maps 
topologically onto ry, pops maps topologically onto yx, psp, onto zy and so on to 
Px Pi, Which maps onto yx. (Note that k must be even.) Hence T' is equivalent 
to the transformation f(1, 6) = sin 3k@ of p = 1 into (—1, 1). 

Remark. In each case the integer k is exactly the maximum power among 
the sets 7” '(b) for all b « B. If k is odd, B is necessarily a simple closed curve, 
T is completely alternating’ and all the sets T~*(b) are of power k. If k is even, 
B may be either a simple closed curve or an are zy; and in the latter case, 7” '(z) 
and 7 *(y) are each of power 3k, while for every other point b of B, 7~*(b) is of 
power k. 

(3.3) If A is a simple arc pq and T(A) = B is interior, B is a simple arc 
and there exists an integer k such that T is topologically equivalent to the transforma- 
tion f(x) = cos krx of the interval (0, 1) into the interval (—1, 1). 

Proof. Since, by (1.1), B must have at least two points of order 1 but no 
point of order > 2, it follows that B is a simple are uv. 

Referring again to the proof of (3.1), we see that in this case D, = u + vD 
T(p) + T(q) and D, = T(u) + T*(v). Hence if we order the points of D, 
on A from p to g:p = po, Pi, *** » Pe = G and suppose the notation chosen so 
that T(p) = u, then by (3.1), pop: maps topologically onto wv, pip, maps topo- 
logically onto vu, pops onto uv, and so on to px_1pe, Which may map either onto 
uv or vu. Hence it is seen at once that T is equivalent to the transformation 
f(x) = cos krz of (0, 1) into (—1, 1). 

By Corollary (3.11), the Betti numbers of a graph cannot be increased under 
an interior transformation. This conclusion can be strengthened somewhat 
for graphs without endpoints, namely, in that we can obtain the same conclusion 
for any continuous transformation 7(A) = B which increases the (Menger- 
Urysohn) order of no point of A. Such a transformation will be called non- 
order-increasing. 

(3.4) Lemma. If A is aconnected linear graph without endpoints and T(A) = 
B is a non-order-increasing transformation, then B is a linear graph and p'(B) < 
p (A). 

Proof. Since A has only a finite number of points of order # 2 and has no 
point of increasing order, it follows that B must have these same properties 
and hence must be a graph. 

Let X be a finite set of points in A such that X contains all points of A of 
order # 2 and T(X) = Y contains all points of B of order # 2 and such that 
each component of A — X or B — Y has two distinct limit points in X or Y 
respectively. Let a° and #° be the number of points in X and Y respectively. 
Let a’ and @' be the number of components in A — X and B — Y respectively. 


® Loe. cit. 
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Then by the Euler-Poincaré formula we have 
(i) p(A) = a — a +1, p(B) = 6-6 +1. 
Also, if we set 
a? 
X=L ni, Y=, 

we have 

a? go 
(ii) Dd o(z:) = 2a’, 2 o(ys) = 28", 

1 
where in general o(x) denotes the Menger-Urysohn order of the point z. Now 
for each 7 S 8°, let 


X-T (ys) = ti + ai + +++ + af. 


Then 
ge B° ky : a? 
(iii) Dk =e? and YO Ye = X= DH. 
I i=1 j=1 1 
Since by hypothesis o(z}) = o(y;) and o(z}) = 2, we have 
ky : ky : 
(iv) > o(ai) = ofys) + > o(xi) = o(y:) + 2(k; — 1), 
7=1 hs 
whence 
89 kj 89 30 
(v) > LD (xi) = > o(yi) + 2 > ks — 26°. 
i=l j= i= i= 


Thus from (ii), (iii) and (v) we get 
2a' = 28' + 2a’ — 2° 
or 
(a' — B') — (a — 8’) 20, 
whence, from (i), 
p'(A) — p(B) = (a' — B) — (oe — B) 20 
or 
p'(A) = p(B). 


(3.5) Let T(A) = B be non-order-increasing. Let A be a connected graph 
having a set X of q endpoints and let T(X) contain r points. Then B is a graph 
and p'(B) S p'(A) + (3)(@ — 7). 


For we can replace (iv) by 


(iv Dole!) = oy) + Y olat) = oy) + 2k -V -(-1) HSH), 
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where q; is the number of points of order 1 of A in T ‘(y:) and the notation is 
chosen so that o(z}) = min [o(z})] and q; > Ofori < r. Summing, we get 


po ky p° 8° 7 
(v)’ > > o(zi) = ® o(y:) + 22>) k; - 28° — > qi + fr. 


Whence, as before, 
2a' > 26' + 2a’ — 28° — (q — 1), 
(a' — 6’) — @’ — #) 2-4-2), 
p'(A) — p(B) = -3q — 1»), 
p(B) S p'(A) + 3q — 7). 

(3.51) Corotuary. If 

(i) gq = 0, i.e., there are no endpoints, 

(ii) gq =r,2.e., T is (1-1) on the set X of endpoints of A, 

(iti) ¢g — r S 1, #.e., at most one pair of endpoints in A map into one point in B, or 
(iv) g S 2, 2.e., there are at most two endpoints in A, 

then p'(B) < p'(A). 

(3.6) THrorem. Jf A is a one-dimensional compact locally connected con- 
tinuum and T(A) = B is an interior transformation, then p'(B) < p'(A). 

Proof. Clearly we may suppose p'(A) finite. If, contrary to our theorem, 
p'(B) > p'(A), we can choose an A-set H in B without endpoints and such that 
p'(H) > p'(A), for we need only take a finite number of true cyclic elements of 
B so that their sum S has Betti number p'(S) > p'(A) and let H be the smallest 
A-set in B containing S. 

Let K be a component of T77'(H). Then since T(K) = H is an interior trans- 
formation [by (1.5) above, T7(K) = H; and T is interior on K, since there are 
only a finite number of components of T~'(H)], and since H has no endpoints, 
it follows that K has no endpoints because 7 is non-order-increasing on K. 
Whence, by (3.4), p'\(H) < p'(K). But p'(K) S p'(A), and this gives p'\(H) < 
p'(K) Ss p'(A), contrary to our supposition. Accordingly p'(B) < p'(A). 

(3.61) Corotiary. If A is a dendrite, so also is B. 


4. Light interior transformations. It will be recalled from the introduction 
that a transformation 7(A) = B is light provided that for each b « B, T~'(b) is 
totally disconnected. We begin this section with a theorem for arbitrary com- 
pact sets analogous to a result of Stoilow’s” for.the case of a transformation of 
one plane region into another. 

(4.1) Turorem. Let T(A) = B be a light interior transformation, where A is 
compact. Then if pq is any simple arc in B and py is any point in T™(p), there 
exists a simple arc pogo in A such that T(poqo) = pq and T is topological on po qo. 


1° See Stoilow, loc. cit. The author learned recently that a theorem essentially the same 
as (4.1) has been discovered independently by Zippin and Montgomery. See an article by 
Montgomery in a forthcoming issue of the Transactions of the American Mathematical 
Society. 
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Our proof for this theorem will make use of the following definition and lemma. 
DerFIniTIon. If o:p = 2%, %1,°°*,2n = @ (xi precedes 2,4; in the order 
p, Q) is a subdivision of pq, a continuum K in 7~'(pq) will be said to proceed 
directly from T~'(p) to T~‘(q) relative to the subdivision ¢ provided T(K) = pq 


, , ys . ° ol 
and K = pp K;, where K; is a continuum in JT” (2;:2;). By the norm of 
» 1 


such a continuum K will be meant max 6(K;,). 

Lemma. (Given any subdivision a’ of pq and any « > 0, there exists a subdivision 
o of pq containing o’ and a continuum K of norm < «in T (pq) which contains 
po and which proceeds directly from T~'(p) to T~*(q) relative to o. 

Proof of Lemma. Let S denote the set of all points g’ on pq for which such 
a subdivision exists on pq’ containing all points of o’ on pq’. Let x be any 
point of pq. 

If x eS, let us take the subdivision ¢ = ¢, on px and the corresponding con- 
tinuum K = K,. Let zeK-T (x). There exists an eneighborhood U of z 
such that F(U’)-7"'(«) = 0. There exists a point v on z, such that if w is any 
point whatever of xv, F(U)-T™'(w) = 0 and w does not belong to o’ unless 
w = x. It follows that Q = T‘(zv)-U is both open and closed in 7” '(zv). 
Accordingly 7(Q) = rv is an interior transformation. Let K,4, be the com- 


ponent of Q containing z, let o, be the subdivision p = 2%, 21, °** ,2n,2%nu1 =v 
n+l 


of pu and let K, = > Ki. Then clearly K, is a continuum of norm < ¢ which 
1 


proceeds directly from 7T”'(p) to T~‘(v) relative to o, , so that v belongs to S. 
Thus S is open in pq. 

Now if x «pq — S, we can choose zx to be the first point of pg — S on pq. 
Hence pr — x CS. If we take a sequence y;, y2, --: on px so that y; — 2, 
and choose points z; ¢ K,,-T"'(y;), where Ky, is a continuum of norm < e in 
T ‘(py:) proceeding directly from 7'(p) to T~'(y,) relative to a subdivision 
oy, of py; which contains all points of o’ on py; , the sequence 2 , 22, --- will 
have a limit point z on 7~*(x), and we may suppose z; —> z. Let us choose the 
e-neighborhood U of x exactly as before. There exists an 7 such that for any 
wey;2, T '(w)-F(U) = Oand such that no point of y;z, except possibly z, belongs 
to o’. Hence if Q = 7 *(y;z)-U, Q is both open and closed in T~*(y;x) and 
7(Q) = y:z is interior. Now if o,,; is denoted by p = 2%, %1,°**, 2, = yi and 


K,, = > K,, , let oz be the subdivision p = 2, %1, ++, 2n, Lnur = 2 OF pa, let 
1 
. n+l 


K+: be the component of Q containing z; and K, = > K;. Then clearly K, is 
1 


a continuum of norm < ¢ which proceeds directly from 7~'(p) to T~’(z) relative 
to ¢, , and o, contains all points of o’ on px. But this gives z e S, contrary to 
our supposition. Accordingly S = pq, and the lemma is proved. 

Proof of the Theorem. There exists a subdivision 01: p = 2%, °° ,2n, = 


m1 
of pg and a continuum K’ = >> Kj of norm < 1 which contains pp and proceeds 


1 
directly from 7” '(p) to T~*(q) relative to o; . 
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In general, for each k there exists a subdivision 


k k 
Or. P = Xo, °** , Tiny = Y; 


nk 
. ° . rk rk 4 
which contains o;_;, and a continuum K* = >> K¢ of norm < 1/k which con- 
1 


tains po and proceeds directly from T~'(p) to T~'(q) relative to ox . 

Since the sequence K', K*, --- contains a convergent subsequence, we may 
suppose, without loss of generality, that the whole sequence converges to a 
limit which we shall call E. Since each K‘ is a continuum with 7(K') = pq, 
it follows that E is a continuum and T(E) = pq. To prove that £ is an are 
and T(E) = pq is topological, it suffices to prove that for each y € pq, T'(y)-E 
reduces to a single point. 

Now since for each k, i, j (i < j, i, j S nx) there exists a continuum Ki; = 

I -~ 

> Ki, in K* which contains K*-7~(z{ 5), it follows that for each k, 7, j there 
m=i+1 ~, 
exists a continuum Ej; = lim Ki", in E which contains E-T7'(2ir'). (Note: 


IM JNimim 

™m 

. ° : k &k ob k 
im and jm are integers such that 2:5" = aj, 235" = aj.) 


Now for any y « pg and any k let us choose i and j so that (i) y is interior 
to the are zjz5 if pp ~ y ¥ q, (ii) y= 21 = 2 = pify = p, y= 25 = qify = 4, 
and (iii) 0 <j — i < 2. Then since in all cases E-T '(y) is interior to Ej; 
(rel. E), it follows that E-T™'(y) = lim Ei;. Accordingly, since each E%; is a 

(kv) 


continuum, E-7~'(y) is a continuum. Therefore E-7™'(y) reduces to a single 
point, since T”’(y) is totally disconnected, and our theorem is proved. 

Remarks. It is of interest to note that the hypothesis of the theorem just 
proved is satisfied even in cases such as that exhibited in the example given in §2. 
Also it results at once from this theorem that the property of containing no are 
is invariant for compact sets under light interior transformations. It may be 
noted further that, whereas the initial point po of the are pogo can be chosen 
arbitrarily in 7” '(p), the terminal point q is not subject to choice but may 
easily depend on the choice of po. 

The methods of proof used in (3.1) together with the theorem just proved 
yield at once 

(4.2) THrorem. Under the conditions of (4.1), if J is any simple closed curve 
in B, then for any integer n > 0 either there exists a simple closed curve C in A and 
an integer k S n such that T(C) = J and, onC, T is topologically equivalent to the 
transformation w = z* on |z| = 1, or there exists a simple arc X in A such that 
T(X) = J and, on X, T is equivalent to the transformation w = e”' on the interval 
—(n+1)r Sy S (n+ 1)x. [If there exists no such simple closed curve C for 
any k, then there exists an open curve L in A such that T(L) = J and, on L, T is 
equivalent to the transformation w = e”' on — © <y < @. 

Note. Since on a linear graph A any interior transformation is necessarily 
light, it is clear that in so far as it concerns simple closed curves in B, (3.1) is a 
consequence of (4.2). Also since the open curve L (when it exists) necessarily 
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contains infinitely many disjoint intervals all of diameter greater than some 
d > 0, we have 

(4.21) Corottary. If T(A) = B is interior and light, where A is hereditarily 
locally connected, then for any simple closed curve J in B there exists a simple closed 
curve C in A and an integer k such that T(C) = J and, on C, T is topologically 
equivalent to the transformation w = 2 on |z| = 1. 

(4.3) THreorem. Let T(A) = B be interior, where A is compact. For any 
simple arc pq in A there exists a continuum E in A such that T(E) = pq and, on E, 
T is monotone. 

Proof. Let us factor T into the form 7:7, where 7; is monotone and 7; is 
light. Let A’ = 7,(A). Then, by (1.6), 7:(A’) = B is interior and light. 
Hence by (4.1) there exists an are pogo in A’ such that 7'2(poqo) = pq is topologi- 
eal. Let E = 7Ty'(poqo). Then E is a continuum since 7; is monotone, and 
since 


T(E) = T2T\(E) = T2T:Ty'(poq) = T2(poqo) = pq 


and 72 is topological on 7;(E) = pogo, it follows that T(#) = pq is monotone. 

Note. Theorem (4.1) also follows directly from this theorem. 

(4.4) THrorem. Let T(A) = B be interior and light, where A its compact. 
If K is any locally connected continuum in B and C is any component of T~*(K), 
the transformation T(C) = K is intertor. 

Proof. Let E be any open subset of C. There exists an open subset D of 
T'(K) such that D-C = E. Let y be any point of T(E) and let z « E-T “(y). 
Since dim 7~'(y) = 0, there exists a set U open in D which contains x and is 
contained in D and such that F(U)-T"'(y) = 0. Let T[F(U)] = Ko, let R 
be the component of K — Ko containing y and let Q be the quasi-component 
of T-'(R) containing x. By (1.4), 7(Q) = R; and since T"'(R) C T'(K) — 
T (Ko) C T'(K) — F(U) and Q DxeU, we have Q CU CD. Whence, 
Q C D-T'(K); and since Q is quasi-connected, this gives Q C C, so that Q C 
D-C = E. Accordingly T(Z) > T(Q) > R, so that T(E) is open in K. 


5. Transformations on plane sets. For light interior transformations applied 
to plane locally connected continua we prove the following result concerning 
the invariance of iocal connectivity under the inverse transformation. 

(5.1) Turorem. Let A be a plane, compact, locally connected continuum 
and let T(A) = B be a light interior transformation. Then if K is a locally con- 
nected subcontinuum of B, T~'(K) is locally connected. 

Proof. If this is not so, there exists" a point p e T”'(K) = H, a simple closed 
curve J about p with interior Z and an infinite sequence M, , M2, --- of quasi- 
components of E-H converging to a continuum M in J + E containing p and 
such that M,, M2, --- lie in distinct components N,, N2,--- of H-(E + J) 


1 For details of how to obtain these sets the reader is referred to papers by R. L. Moore 
in the Bulletin of the American Mathematical Society, vol. 25 (1919), pp. 174-176, and 
vol. 29 (1923), pp. 289-302. 














INTERIOR TRANSFORMATIONS ON COMPACT SETS 381 


and such that for each i, k > 0, M; separates M;_, and M;,, in J + E. Since 
by hypothesis 7” 'T(p) is totally disconnected, we may suppose J chosen so that 
J-T'T(p) = 0. Let By = T(A-J) and let R be the component of K — K-B,y 
containing T(p). [Note that T(p) cannot belong to By, since J-T"'T(p) = 0.] 

Now by the local connectedness of K, R is open in K. Hence for almost all 7, 
T(M,)-R # 0, and thus we may suppose this holds for all 7. Since T(H) = K 
is interior [by (1.2)], it follows by (1.4) that each quasi-component of 7” '(R) 
maps onto all of R under 7. Since T”'(R)-J = 0, it follows at once that there 
exists a sequence L, , Lz, --- of distinct quasi-components of 7”'(R) converging 
to a continuum L where p e L C M and for each i, L; C M;. 

Since A is locally connected at p but H is not locally connected at p, it follows 
that there exists a component Q of B — (K + Bo) which has a limit point q 
in R. Since T(L;) = R for each 1, it follows that for each i there exists a point 
qa:¢€T '(q):L;. Now since, by (1.41), 77"(Q) consists of a finite number of 
components of A — H — T~'(Bo) and each q; is a limit point of 7~*(Q), it follows 
that there exists some component W of T~'(Q) such that at least three, say 
Gi; » Vis » Vis (ta < t2 < i) of the points q; are limit points of W. Since T”'(Bo) = 
A-J and q; C E, we have W C E, and since H > M; for each 7, we have W-M; 
= Oforeachi. But now M,,separates M;, and M;, in J + E, and since W C E, 
W > ai, + @:,, this gives W-M;, # 0, which is impossible. Thus the sup- 
position that our theorem is false leads to a contradiction. 

(5.11) Corotiary. Under the conditions of (5.1) there are only a finite 
number of components C, , C2, --- , Cn of T'(K) and, for each i, T(C;) = K is 
interior. 

It may be shown by simple examples that (5.1) does not remain valid if we 
remove either the condition that A lie in a plane or that T be a light trans- 
formation. 


UNIVERSITY OF VIRGINIA. 











BETTI NUMBERS OF 3-FOLD SYMMETRIC PRODUCTS; A 
CORRECTION 


By M. RicHARDSON 


In a previous paper! the writer gave a general method for computing the 
Betti numbers of a complex k obtained by identifying the points of a given 
complex which are congruent under a finite group G of transformations subject 
to certain combinatorial conditions. The Betti numbers of k appeared as the 
ranks* of certain matrices (27;). As one of the applications of this general 
method, the Betti numbers of the 3-fold symmetric product ks, of a given 
complex K, were computed and the results published without proof.’ In 
calculating the rank of the matrix (27) for this case, an error was made, which 
invalidates the formulas of Theorem 5. The correct formulas are given here. 

TuHeoreM. [fk is the 3-fold symmetric product of K, then 


(2 [Rn( Kon) “bh 2R, 4 3  ® (—1)'R;R,_2i] ’ m= 3s, 


R,,.(k) = J 
$ [Rn( Keon) +3 ae (- 1)" Ri: Rn] ; m # 3s, 


where Ra = R,(K), and K;, = K X K X K. 

Remark. If in the mechanical application of these formulas, a term happens 
to contain R.(K), where a > n, or a < 0, this term is to be ignored. 

We note also the following correction to the cited paper. Theorem 1 as stated 
is valid for uw = 1. For u > 1, a slightly modified argument proves that 
R,,(k, x“) = the rank mod + of (x;;) with elements reduced mod z, (i = 1, --- , 8; 
j =1,---,7), p not divisible by x. This change does not affect later develop- 
ments. 


INSTITUTE FOR ADVANCED Stupy. 


Received October 14, 1936. The existence of an error in the formulas was called to 
the author’s attention by R. J. Walker. 8S. H. Kimball had previously and independently 
noted the error and obtained the correct formulas. The author had the advantage of 
seeing his results while preparing this note. 

1 On the homology characters of symmetric products, this Journal, vol. 1 (1935), pp. 50-69 
For definitions, etc., see that paper. 

2 Loc. cit., Theorem 1, pp. 52-53. 

3 Loc. cit., Theorem 5, p. 61. 























THE EXPANSION THEORY OF ORDINARY DIFFERENTIAL SYSTEMS 
OF THE FIRST ORDER 


By Rupo.iren E. LANGER 


1. Introduction. It is a sufficiently curious fact that in the extensive litera- 
ture of the expansion of arbitrary functions in series of characteristic solutions 
of ordinary linear differential systems almost no works dealing with the case 
of the differential systems of the first order are to be found.’ This is the more 
remarkable since in this case the integration of the differential system is possible, 
and much of the analysis which invariably encumbers the discussions of systems 
of higher order is thereby obviated. To be sure, the Fourier’s series, which 
stands as a prototype in this field, is usually associated with a differential system 
of the second order, and so the systems of higher order would naturally have 
suggested themselves as generalizations. Even for purposes of generalization, 
however, the system of the first order also merits attention, for because of 
the relative simplicity of its analysis a material generalization becomes possible 
in the way of a relaxation of restrictive hypotheses. This is found to be far 
from trivial. The theory of its expansions, as it is to be found in the literature, 
involves in fact a number of striking peculiarities, in which it contrasts sharply 
even with the expansion theories of the most closely analogous differential 
systems of the second order. Thus, by way of instance, one and the same 
expansion may be generated by an infinity of essentially distinct (i.e., non- 
equivalent) functions; and again the expansion of a given function, though it 
may converge, only rarely converges to the function immediately concerned. 

It is the purpose of the present paper to present here a new expansion theory 
for the differential systems of the first order, one which differs from that in the 
literature and is believed to have advantages over it. It will be found to 
permit, on the one hand, of a material further relaxation of the restrictions 
upon the system, and to lead, on the other hand, to results which, on the whole, 
are much more nearly in consonance with those which obtain in the existing 
theories for systems of the second or higher orders. In particular, it will be 
found that the formal association of a function with an expansion is in a suitable 
sense unique, and that under quite customary conditions an expansion converges 
to the function with which it is formally associated. Even so, to be sure, some 
distinctive peculiarities persist. These will generally be recognizable, however, 
as inherent in the nature of the case. 


Received April 30, 1937. 
! A notable exception is M. H. Stone, An unusual type of expansion problem, Transactions 
of the American Mathematical Society, vol. 26 (1924), p. 335. 
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2. The differential system. The differential system to be considered, the 
general real differential system of the first order, is of the form 


(1) y'(s) — [pg(s) + r(s)]y(s) = 0, 
ay(a) = By(b). 


The parameter, denoted by p, is to be free to range over all complex values. 
On the other hand, the variable s, the functions ¢(s) and r(s), and the constants 
a and 8 are to be real. Beyond this q(s) and r(s) are to be single-valued and 
summable in the sense of Lebesgue over the interval (a, 6), and q(s) is to fulfill 
a hypothesis of which the explicit statement is deferred to §3 below. 

As applied to the differential equation of the system (1) the term “solution” 
will be understood to designate a function which is an indefinite Lebesgue 
integral and which satisfies the differential equation almost everywhere on the 
interval (a, b). In this sense the equation is solved by the formula 


i [oq(s) + r(s)] ds 
ce?" 


(2) y(s) = 


’ 


with ¢ an arbitrary constant. Inasmuch as any two solutions must be linearly 
dependent, the differential equation determining their Wronskian to be zero, it 
follows that the formula (2) includes all solutions. 

A solution of the differential equation which is not identically zero but which 
satisfies the boundary condition is to be called a characteristic solution of the 
differential system. The substitution of the form (2) into the boundary condi- 
tion yields for the existence of such a characteristic solution the condition 


b b 
of qlaids +f r(s) ds 
a= Be a a ? 


In general this condition restricts p to an isolated set of “characteristic values”. 

The exceptional cases are: first, that in which both a and 8 are zero, the condi- 

tion then being obviously vacuous; second, that in which one but not both of 

the constants a, 8 is zero, the condition then being clearly impossible; and, 
b 

third, that in which [ q(s) ds is zero, the condition then being independent of p. 


a 


To exclude these exceptional cases it will be assumed that 
b 
(3) aa | q(s)ds # 0. 


When this hypothesis is fulfilled the differential system may be normalized as 
follows. 
The relation 


b b 
Bl vf alerdet Freda 
{pb je 


bol ems 
determines the real constant v. With it the change of variable 


: [vq(s) +r(s)] ds 


y(s) = w(s)e** ' 
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transforms the differential system (1) into the form 


w’(s) — [p — v]q(s)w(s) = 0, 
(a) = F wid) 
wa) = ry w(b). 


The substitutions 


a=~¢ 
r= 
b—a’ 


“) 
h = (p — v) | q(s) ds, 


p(x) = ee 
i q(s) ds 


u(x) = w(s) 
complete the normalization, reducing the system (1) to the form 
(4.1) u’(x) — Ap(x)u(x) = 0, 
u(O) = u(1), 
if a8 > 0, and to the form 
(4.2) u'(r) — Ap(x)u(x) = 0, 
u(0) = —u(1), 
if a8 < 0. The function p(x) is summable over the interval (0, 1) and 


1 
(5) [ p(xa)dx = 1, 


3. The hypothesis. If there exists on (0, 1) some sub-interval, say 6, upon 
which the function p(z) is essentially of one sign,’ then on this sub-interval the 
function 


z 
(6) P(x) = [ p(x) dz, 
0 
is monotone and, of course, continuous. The transformation 


(7) t = P(z) 


accordingly maps the interval 6 in a unique and continuous manner upon a 
corresponding interval, say w, of the t-axis. If 5is taken to be directed positively, 


? The term “‘essentially’’ will be used throughout the discussion in the sense of ‘‘almost 
everywhere’’. 
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w will also be directed, positively or negatively according as on the interval 6 
the function p(x) is essentially positive or essentially negative. Finally, on w 
the transformation (7) has a unique inverse 


(8) x = P(t), 


in which P~'(t) is the Lebesgue integral of a summable function which essen- 
tially maintains its sign.” 
The hypothesis to which the given differential system when normalized, i.e., 
in the form (4.1) or (4.2), is to be subjected is the following: 
There shall exist on the fundamental interval (0, 1) at least one open point-set, 
to be designated by A, upon which the coefficient function p(x) fulfills the requirements : 
(i) that it be essentially of one sign upon each of the intervals comprising the set A; 
(ii) that for almost every value of t on the range 0 < t < 1 the congruence 


P(x) =t (mod 1), 
be fulfilled by some x on A, but that for each value of t it be fulfilled by at most 


one x on A. 

The point-set A, being open, consists of enumerably many non-overlapping 
open intervals. These are to be designated by 6;. The hypothesis (i) insures 
that under the transformation (7) there corresponds to each interval 6; an 
interval w; on the t-axis. The hypothesis (ii) thereupon insures that these 
intervals w; are non-overlapping, and that the set of them as a whole would be 
obtainable by making a suitable sub-division of the interval 0 < ¢ < 1, and 
then at most relocating some or all of the individual sub-intervals by translating 
them through appropriate integral multiples of the unit distance. To avoid 
meaningless details it will be supposed that the designation of intervals 4; is 
so made that the function p(x) is essentially of opposite signs upon any two 
such intervals which abut, or, stated differently, that any number of abutting 
component intervals of the set A upon which the function p(x) is essentially 
of the same sign will be regarded as comprising together one and the same 
interval 6;. The intervals w; will be collectively designated as the point-set Q. 
Since the transformation (8) exists on each individual interval w; , it evidently 
exists over the entire set 2, and the point-sets 2 and A are the transforms of 
each other under the relation (7) or its inverse (8). 

The simplest type of differential system fulfilling the hypothesis stated is that 
in which p(x) is essentially positive throughout.‘ The point-set A consists in 
this case simply of the interval 0 < x < 1. Because of its simplicity this type 
of system fails to display the peculiarities which are distinctive of the more 
general cases. A case which is still simple for the present discussion, though 
not amenable to earlier ones, is that in which the point-set A does not com- 
pletely cover the interval (0, 1), the function p(x) being essentially of one 
sign on A, but not essentially of either sign on any interval which is contained 


3 Cf. M. H. Stone, loc. cit., p. 348. 
‘This was completely treated by M. H. Stone, loc. cit. 
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in the complement of A. In this case as in the preceding one the point-set A is 
unique. In the general case, however, that is not so, the hypothesis on A 
being in fact generally satisfied by infinitely many different point-sets. This is 
always so when the total variation of the function P(x) over those sub-intervals 
of (0, 1) on which it is monotone exceeds the unit value. 

The distinguishing features of the present hypotheses as compared with those 
of the earlier theory applying to the same differential systems may be sum- 
marized briefly as follows: 

(i) the number of intervals constituting the set A is not restricted to be finite 
but may be enumerably infinite; 

(ii) a set of intervals A need not completely cover the fundamental interval; 

(iii) the function p(x) is subject to no hypothesis other than that of sum- 
mability over the point-set complementary to A; 

(iv) the function p(x) is not restricted to be bounded. 


4. The relation of “orthogonality”. Since the differential systenis (4.1) and 
(4.2) are both explicitly integrable, their characteristic values and solutions 
are directly obtainable. These are found to be for the system (4.1) 


o = 0, u(x) = 1, 
(9.1) ) Ay P (2) 
An = 2nzi, Uin(t) = on™ n=1,2,3,---, 


and for the system (4.2) 


An = (2n — 1)zxi, 


(9.2) 
— n= 1,2,3,-:-. 


Usn(x) 


In either case the set of solutions as a whole has the property of satisfying 
the relations 


0, if h# —k, 


1 
[ P(x)un(x)ux(x)da = % if h = —k. 


This property of weighted normality and orthogonality with respect to the 
interval determined by the points at which the boundary condition applies is, 
of course, wholly analogous to those which obtain for the characteristic solu- 
tions of differential systems generally, and which have been made fundamental 
to many deductions of the associated expansion theories. Precisely this prop- 
erty, however, will find no application in the present discussion, and it is 
principally in this point that the present theory for systems of the first order 
departs from that which is to be found in the literature. 

The point-set 2 will easily be recognized to be of such a form that a function 
defined upon it may still consistently be specified to be periodic or quasi-periodic 
with respect to the unit distance. Such a specification moreover then extends 
the definition of the function to almost all values of t. It will be convenient to 
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reserve the notations F,(t) and F_,(t) to designate respectively functions whose 
definitions on 2 are extended by the relations 


(a) F(t + 1) = Fit) 
(b) F_4(t +1) = —F4(t). 


In the case of a function of the type F,(t) the following will readily be verified, 
namely, that if the function is summable, then 


(11) a 


the symbol | w; | designating the interval w; redirected if necessary so that its 
sense is positive. At the same time it is known’ that in virtue of the trans- 
formation (7) 


(10) 


1 
F(t) dt =: [ F(t) dt, 
0 


|;| 


[ Fi(t)dt = | p(x) F\(P(2)) dz, j = 1,2,3,---, 


1 
and that conversely, if p(x)f(x) is summable over A, then under the trans- 
formation (8) 


I p(x) f(a)dx = [ f(P™'() dt, j=1,2,3,---. 
6; @j 


Since in the case of each j the replacement of w; by | w; | may be compensated 
for by the replacement of p(x) by | p(x) | on 4; , it will be seen that the relations 
above when summed with respect to 7 lead in virtue of the formula (11) to the 
result 


, = 
(12) [ F\(t)dt = | | p(x) | f(x) da, 
0 4 
the existence of either integral implying that of the other, and the functions 
involved being related thus: 
f(x) = F\(P(2)) on A, 
F(t) = f(P™(d) on Q, and F(t + 1) = Fit). 


Now whether the differential system in question be (4.1) or (4.2), it is seen 
from the appropriate formulas (9.1) or (9.2) that for every choice of the sub- 
scripts h and k, and as a function of t, the product {w,(x)uy(r)}--p-1¢@) admits 
the unit as a period, and that 


» _ {fh -k, 
oe ‘a, if h = —k. 


It follows, therefore, from (12) that 


(13) 


~ 


e . (0, if h # —k, 
(14) i | pla) | ug(a)u,(x) dx = \1, ft a 


It is upon this new relation of orthogonality that the present theory will be based. 


5 Cf. M. H. Stone, loc. cit., or E. W. Hobson, The Theory of Functions of a Real Variable, 
second edition, 1921, pp. 592-595. 
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5. The expansion of an “arbitrary” function. The expansions of a given 
function f(z) in series of characteristic solutions differ somewhat in their details 
for the systems (4.1) and (4.2). Let the attention be focused first, therefore, 
upon the system (4.1). If f(x) is a function given arbitrarily except for the 
restriction that p(x)f(z) be summable over the point-set A, it may be regarded 
as formally associated with a series of characteristic solutions in the manner 


(15) f(x) ~ aow + DX fa,u(x) + a, u_,(x)}. 


n=l 
The operations of multiplying this by | p(x) | u_.(z) and integrating term by term 


over A lead formally, and because of the relations (14), to a familiar “evaluation”’ 
of coefficients, i.e., 


(16) an = | | p(x) | f(a)u_r(x) dx. 
A 


With these coefficients, and with the explicit forms of the solutions as given 
by the formulas (9.1), the relation (15) takes the form 


fiz) ~ 4Ay + DY [A, cos 2neP(x) + B, sin 2nwP(x)}, 


n=l 


~ 
_— 
~I 
_ 
Il 


2[ | p(x) | f(x) cos 2nxP(x) dz, 
4 . 


B, 2 | | p(x) | f(x) sin 2nwP(z) dx. 

Let F,(t) now designate the periodic function associated with f(z) by the 
relation (13). The transformation (8) is found then, in virtue of the formula 
(12) to convert the expansion (17) into the Fourier’s series for F(t). The 
heuristic motivation for this deduction may now, of course, be abandoned. 
Whenever f(z) is a function such that p(x)f(xz) is summable over A the formulas 
(16) associate with it a set of coefficients and thereby an expansion (15). The 
result deduced above may be formulated then as follows: 

TueoreM. If p(x)f(x) is summable over the point-set A, there is associated 
with f(x) an expansion (15), (16) in terms of the characteristic solutions of the 
differential system (4.1). This expansion is the transform under the relation 
t = P(x) of the Fourier’s series of the function F,(t) which is related to f(x) in the 
manner (13). 

The system (4.2) may be similarly considered. If the expansion associated 
with f(z) is indicated thus, 


(18) f(z) ~ ya {a,u,(r) + a_,u_,(x)}, 
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the coefficients are found, in the manner above, to be again evaluated by the 
formulas (16). With these coefficients and the explicit forms (9.2) of the 
characteristic solutions the expansion takes the form 


f(z) ~ X {A,, cos (2n — 1)eP(x) + B, sin (Qn — 1)eP(2x)}, 
n=l 


(19) A, 


2 | | p(x) | f(x) cos (2n — 1)eP(x) dz, 
a 


B, 


2 | | p(x) | f(x) sin (2n — 1)rP(x) de. 
A 


Let F_,(t) designate now the quasi-periodic function to which f(z) corresponds 
under the transformation (8), i.e., 


(20) F_(t) = f(P"(b) on Q, F_4(t+1) = —F_(t). 


Then under this transformation the expansion (19) is found to become the 
Fourier’s series for the function F_,(#). 

Tureorem. I[f p(x)f(x) is summable over the point-set A, there is associated with 
I(x) an expansion (18), (16) in terms of the characteristic solutions of the differential 
system (4.2). This expansion is the transform under the relation t = P(x) of 
the Fourier’s series of the function F_,(t) which is related to f(x) in the manner (20). 


6. Conclusions. Inasmuch as an arbitrary function and its expansion in 
characteristic solutions transform together into a Fourier’s series and its gen- 
erating function, it follows that these expansions admit of a theory which is 
coextensive with that of the Fourier’s series. Moreover, the facts of this theory 
relative to convergence, summability, etc., either at points or over intervals, 
are evidently deducible simply by appropriate considerations of the manner 
in which the transformation (7), (8) affects the relative properties of the 
Fourier’s series. In the present paper these considerations will be entered into 
only to the extent which will suffice to emphasize the differences between the 
present theory and the theory as given heretofore. The theorems chosen and 
given below differ sharply in these two theories, and in the present one are 
much more nearly in consonance with their analogues in the existing theories 
for differential systems of the second or higher orders. 

Consider any function f(z) and its expansion relative to a point-set A. Under 
the relation (7) they are transformed into the associated function F,(t) (or 
F_,(t), as the case may be) and its Fourier’s series. Now if x is any point of 
the set A, then since this set is open it lies upon ‘some sub-interval which is a 
neighborhood of it and which is wholly included in one of the intervals 6; . 
Under the relation (7) this point x and this neighborhood correspond to a 
point ¢ and a neighborhood of it. As is familiar, however, the behavior of the 
Fourier’s series of F\(t) (or F_,(t)) at the point t is completely determined by 
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the values of this function in such a neighborhood. Under the return trans- 
formation (8) this leads to the 

Tueorem. If p(x)f(x) is summable over the point-set A, the behavior of the 
expansion of f(x) in terms of the characteristic solutions of either of the differential 
systems (4.1) or (4.2) is determined at any point of A by the values of f(x) in an 
arbitrarily small neighborhood of that point. 

Since the function P(x) is monotone on each of the intervals 4; , it follows that 
every point of A is possessed of some neighborhood upon which the transforma- 
tion (7) is monotone. If upon such a neighborhood the function f(z) is also 
monotone, or, more generally, of bounded variation, the same will, therefore, 
be true of the associated function F:(¢) upon some interval which includes the 
corresponding point ¢t. Under this condition, however, the Fourier’s series is 
known to converge at the point ¢ to the value $[Fis(t+) + Fas(t—)], a fact 
which leads to the 

TueorEeM. If p(x)f(x) is summable over the point-set A, then at any point x 
of A in some neighborhood of which the function f(x) is of bounded variation, the 
expansion of f(x) in terms of the characteristic solutions of either of the differential 
systems (4.1) or (4.2) converges to the value 


3[f(zt+) + f(x—)]. 


Consider now the case of a point x, which does not belong to the point-set A, 
but which is a boundary point of it. Under the transformation (7) it corre- 
sponds to a point & which is a boundary point of 2. The behavior of the 
Fourier’s series of either of the functions F,(t) or F_,(é) at the point t is com- 
pletely determined locally, i.e., by the values of the function in any ordinary 
neighborhood of the point. In virtue of-their definitions (13) and (20), how- 
ever, these functions are themselves determined in any ordinary neighborhood 
of t by their values on Q, and this is immediately recognized to mean by their 
values on “a neighborhood of & in 2”, a term which by definition is to signify 
that set of intervals w;, and parts of such, which are congruent (mod 1) to the 
parts of an ordinary neighborhood of t,. Under the return transformation (8) 
a neighborhood of ¢ in 2 corresponds to ‘“‘a neighborhood of 2, in A’’, a term 
which is to be analogously defined to designate that set of intervals 6; , and 
parts of such, upon which the function P(x) takes on values which are congruent 
(mod 1) to values in a neighborhood of P(x). This leads to the 

THeorem. If f(x) is any function such that p(zx)f(x) is summable over the 
point-set A, and if x, is any boundary point of A, then the behavior of the expansion 
of f(x) in terms of the characteristic solutions of either of the differential systems 
(4.1) or (4.2) at x is determined by the values of f(x) in an arbitrarily small 
neighborhood of x», in A. 

The simplest case of a boundary point is that of a point, say x, , which is an 
end point of an interval 6; , and which has another such end point, say 2; of 
an interval 6, , corresponding to it under the congruence P(x;) = P(x.) (mod 1). 
In this case the neighborhoods of x, in A, and of x; in A coincide, and if they 
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are sufficiently small they consist simply of the pair of intervals 6;, 6; or parts 
of them which abut z; and z; respectively. It is seen at once that if f(z) is of 
bounded variation in such a neighborhood of zx; in A, then F4:(é) is of bounded 
variation in the neighborhood of t;. The Fourier’s series accordingly con- 
verges to 4[Fai(ti+) + Fas(ti—)], i-e., in the case of F(t), to 

[lim F(t) + lim Fi(t)|rone, 


tt; tty 


and in the case of F_,(t) to 
[lim F_,(t) + (—1)‘ lim F_i(é))rone, 


t—>t; tt) 


where k is the integer which is determined by the relation t; = t; + k. 

TueoreM. Let f(x) be any function such that p(x)f(x) is summable over the 
point-set A, and let x; and x; be any pair of end points of intervals of A which are 
related so that P(x;) = P(x:) + k, where k is an integer. Then if f(x) is of 
bounded variation in an arbitrarily small neighborhood of x; in A, its expansion in 
terms of the characteristic solutions of the differential system (4.1) converges at x; 
to the value 

3[lim f(x) + lim f(x)Jeona, 


zr! 
. z—2) 


and its expansion in terms of the characteristic solutions of the differential system 
(4.2) converges at x, to the value 


[lim f(z) + (—1)‘ lim f(x)],ona- 


zr! 
mr 


If A contains intervals which abut the points z = 0 and z = 1, these points 
stand in the relation of x; and 2; to each other, with the integer k as 1 or —1 
in virtue of the relation (5). Hence we have the following 

Corotiary. If p(x)f(x) is summable over A, if x = 0 and x = 1 are end 
points of intervals of A, and if f(x) is of bounded variation in some neighborhoods 
of these points, then the expansion of f(x) in terms of the characteristic solutions of 
the differential system (4.1) converges at x = 0 and at x = 1 to the value 


2(f(O+) + f—)I, 


while the expansion of f(x) in terms of the characteristic solutions of the differential 
system (4.2) converges at x = 0 to the value 


21f(0+) — fl—)], 


and converges at x = 1 to the value 


2[f(l—) — f0+)]. 


The function F,(t) (or F_,()) will evidently be null, i.e., will vanish almost 
everywhere, if and only if the function f(z) is null on the point-set A. The 
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condition that the difference of two functions be null, which is necessary and 
sufficient for the identity of all their Fourier’s coefficients, leads, therefore, 
to the 

TueoreM. If fi(x) and fo(x) are any two functions such that p(x)fi(x) and 
p(x)fe(x) are summable over A, a necessary and sufficient condition that their 
expansions tn terms of the characteristic solutions of either of the differential systems 
(4.1) or (4.2) be identical is that they coincide almost everywhere on A. 

Since the expansion coefficients as given by the formulas (16) involve the 
values of f(x) only over A, it is evident that every expansion is strictly relative 
to a point-set A. For any specifically given function f(x) there will, therefore, 
be as many expansions in terms of characteristic solutions of either of the 
differential systems (4.1) or (4.2) as there are point-sets A which fulfill the 
hypotheses stated in §3, and over which p(xz)f(x) is summable. Any two such 
point-sets A may, of course, have points in common, and at such points the 
expansions of a given function relative to the two point-sets in question will be 
seen at once to behave identically. For since a point common to two sets A 
is an inner point of each, it possesses a neighborhood which is also contained 
in each of the sets. This point and neighborhood correspond to a point on the 
t-axis and a neighborhood of it which is contained in each of the corresponding 
point-sets 2. The functional values in this neighborhood suffice, however, for 
the determination of the behavior of the Fourier’s series at the point in question, 
and the result of this determination is, therefore, independent of which point- 
set Q is held in mind. 

The case of a unique point-set 4 was found in §3 to be exceptional. It is 
clear that the case of a unique expansion for a function f(x) such that p(x)f(x) 
is summable over the interval (0, 1) is exceptional in precisely the same sense. 
The case of such a unique expansion relative to the interval (0, 1) occurs only 
in connection with the simplest systems, namely, those in which p(x) is almost 
everywhere positive. In general, there is a multiplicity of expansions for each 
given function, and this feature must be regarded as one in which the expansion 
theory for differential systems of the first order as here given is at variance 
with the existing analogous theories for differential systems of higher orders. 
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TRANSFORMATION OF DIFFERENTIAL EQUATIONS IN THE 
NEIGHBORHOOD OF SINGULAR POINTS 


By C. I. LUBIN 


1. Introduction. The singular points of the system of differential equations 


(1) — = X(z,y), oY Y(z, y), 

where the functions X(z, y) and Y(z, y) both vanish at a point, have been 
considered by many mathematicians. Beginning with Briot and Bouquet 
the list includes Poincaré, Picard, Bendixson, and continues with Dulac, Malm- 
quist, Perron, Birkhoff, and many others. A full bibliography, particularly for 
complex differential equations, can be found in Mémorial des Sciences Mathéma- 
tiques, Fascicule LXI, Points singuliers des équations différentielles, by H. M. 
Dulac. 

The discussion here assumes the functions X and Y to be real and analytic 
in z and y, in the neighborhood of the singular point. The variables z, y, ¢ 
take on only real values, and only real transformations are introduced. Further- 
more, it is assumed that the singular point considered is at the origin and that 
the first degree terms in X(z, y) and Y(z, y) are such that the differential equa- 
tion can be transformed by a linear transformation to the form' 


! - ‘- 
7 =-yt+ a ajyty = X(z, y); 
i,j7=0 
(1.1) 2 
2 =r+ >> b,x’ y’ = Y(z, y) (i +j2 2). 
= 


At such a point the solutions are closed curves or spirals and the point is called 
a center or a focal point, respectively. 

It is proposed here to find a canonical form for the system (1.1) and to discuss 
the properties of the transformation attaining that form. 


2. Failure of the usual method. It appears from the discussion below that 
it is not always possible to set up formal power series 


u=xr+ DYejr'y’ = f(z, y), 


(2) : 
v=yt Ddijz'y’ = g(z, y) 


Received July 15, 1936. 
! Poincaré, Jour. de Math., (4), vol. 1, p. 172; Picard, Traité d’ Analyse, 1928, Chap. IX 
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which transform formally the system (1.1) into 


du _ dv 
x_ rer 


(2.1) di 


Consequently this form cannot serve as the canonical form for the system of 
differential equations (1.1). 

If there are formal power series (2) leading to the system (2.1), the expressions 
f(z, y) and g(z, y) of (2) would formally satisfy the partial differential equations 


. 0 (7) 
X(z, y) &f + Y(z, y) ff =- 9, 

Ox oy 
(2.2) 

‘ 0g , ag 
X(x, y) = + ¥(z, y) ay 7S 
Introduce the new variables (see Picard and Poincaré, loc. cit.) 

(2.21) x = pcos 8, y = psin 0. 
The partial differential equations (2.2) become 


af Q\) af Rk_ 
(t+ 5) +2 E - —s 
(2.3) 


where © and R are given by 


2 = > p'2,(0) = p(—sin 0X + cos 6Y), 
(2.31) a 
R= } > p' R(0) = p(cos 0X + sin eY), 


and 


X = X(p cos @, psin 0) + psin@ = >> p' X,(8), 
t=2 


Y(p cos 0, p sin 0) — pcos@ = > p' Y;(6). 


t=2 


Yy 


In the above expressions 2; and R;, and X; and Y; stand for homogeneous 
polynomials of degree i in sin @ and cos @, and consequently can be written as 
linear expressions in cos 78, sin 78, etc., i.e., in the form 


a; cos 10 + a;_2 cos (¢ — 2)0 + .-- + b; sin 76 


(2.32) 
+ bj» sin (¢ — 2)0+.---. 
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The formal expansions (2) become in terms of p, sin 6, and cos @ 
(2.33) f = pecos@+ > p f:(0), g = psin@d + > 3 p'9i(9), 
i= S 


where the f; and g; are trigonometric expressions of the type just described, 
(2.32). The existence of the formal expansions (2) in terms of x and y implies 
the existence of the formal expansions (2.33) in terms of p, cos @, and sin 6, and 
conversely, the existence of the formal expansions (2.33) implies the existence 
of those of (2). 

To obtain the f; and g; , substitute the formal series (2.33) in (2.3) and collect 
terms in p". We get 


tf 


(2.4) = —-G9nt+ lh ’ ——— Sn + Mn, 


do 





fn 
dé 


where l,, and m, are trigonometric sums of the above type (2.32). Furthermore, 
the J, and m,, involve f; and g; only for 7 less than n. 

Now suppose fi , 91, --- ,fn—1, Gn. have all been computed and are of the 
type (2.32). To compute f, and g, , set up from (2.4) the differential equation 


If, , bec 
(2.41) <s = a Oe cos j@ + B; sin 7@ (j =n,n —2,---), 
where the A}, B? arise from m, and l, and thus involve f; and g; only for 7 less 


than ». 
The solution of the differential equation (2.41) is 


(2.42) fn = a, co8 6 + 8, sin@ + >> Aj cae a my? 





where a, and 8, are arbitrary and where j = n,n — 2,--- ,0,orj = n,n — 2, 
--+ , 1, depending on whether n is even or odd. 

When j = 1, a case which can only occur when n is odd, and when either 
A} # 0 or By = 0, the solution (2.42) fails. Consequently, in general, the 
system of partial differential equations (2.3) does not have a formal solution 
in terms of p, sin 6, cos 6, of required type. By an expression of required type is 
meant a sum of the form (2.33), i.e., Zp‘f:(@), where the f;(0) are expressible in 
the form (2.32). Thus the partial differential equations (2.2) do not always have 
a formal power series solution. 

Because of this we modify the partial differential equations and consequently 
obtain a different normal form (v. §4 et seq.). 

Let us note here the form taken by f, , g, of the solution, when it exists. Sup- 
pose for all n, say up to and including n’, the solution (2.42) has not failed. 
Consider the solution for some odd number n less than n’. The expression 
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on the right in (2.41) then contains no term in sin @ or cos @, i.e., both Af and 
BP equal zero. If in (2.4) 


l, = a,cos@+ 6, sin@+.-.., 
we have 
m, = —a,sin@+b,cos@+.---, 


where the terms in cos 7@ and sin 76 for 7 > 1 have not been written. 
If f, has the form (2.42), g, becomes 


tf 


If , , F 
(2.43) g, = f- +1, = a, sin @ — B, cos @ + a, cos 6 + by, sin @ + ---, 


where again only the terms in sin 6, cos @ have been written. 


3. A preliminary transformation. In order to simplify subsequent discussion 
a real analytic transformation will be introduced in the theorem below. 

Suppose that the expressions f; , gi (i = 1, 2, --- , 2k) in f and g (2.33) when 
computed by taking all the arbitrary constants a;, 8; zero are of required type 
but that the expressions for foxs: , gers: fail to be. That is, let us suppose the 
differential equation (2.41), for n = 2k + 1, has terms in sin @ or cos @, i.e., 
Af or BY ¥ 0. 

By a method of Dulac,” we have the following 

TureoreM. By a real analytic transformation 


w= 2 + fe (x, y) + --- + foxes (x, y) 
2 = yt go (ry) +--+ + gees (2, y) 


I(x, y), 
g(x, y), 


(3) 


where f(x, y), gi(x, y) are polynomials of degree tin x and y, obtained by substituting 
x = pcos 0, y = psin@ in p'f:(0), p'gi(0), (i = 2,3, ---, 2k), of (2.42) and (2.43) 
and in po fon..1(8), po gon s3(8) of (3.22) below, the differential equations (1.1) are 
reduced to 


a = —2 + Aw(w’ + 2°) — Bow’ + 2)* +... = Ww, 2), 
(3.1) 
a = w+ Bul’ + 2°) + Adciw’? + 2) + --- = Z(w, 2), 


where for the real analytic functions W(w, z) and Z(w, z) terms of higher degree 
than (2k + 1) are not written and where A and B are constants, not both zero, 
uniquely determined, in (3.41) below. 


? Dulac, Bull. Sci. Math., vol. 51 (1923), p. 74. 
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Let the differential equations causing the difficulty in the solution of the 
partial differential equations (2.3) be written 








Ufo, , 
Yes = —guy + acosé + bsin 6 + les, 
(3.2) 
; oo = farsi + ccos9+dsiné + mx, 
where lex1 , Mex41 Contain sin 76, cos 76, only for 7 = 3, 5, --- , 2k + 1, and where 
J J y J 


either a + d # 0 orb — c ¥ O or both subsist. 

Introduce the trigonometric functions f*(@) and g*(6), of the required type 
(2.32), which satisfy the differential equations 
' d da* 
(3.21) ~ = —g* + las, a = f* + muy, 
where the arbitrary constants of the solution are chosen as zero. Since bx. 
and m4; contain no terms in sin @ or cos 6, there are no such terms in f* or g*. 
Now write 





(3.22) Sorsi = f* + @ COs 8, Gait = g* + B cos 8, 
where @ and @ are taken as 
a = —}(b+ 0), B = 3(a — d). 
If we introduce the new variables by the substitution (3), we get as in (2.2) 
pa dw af .- af ,. dz og, , 0,, 
3.3 —=—X+4—)} —~=—X +—Y. 
34) dt ax dy ’ dt ax si ay 


In terms of p and @, the expressions on the right side become 


: F(,, 2), *) ( 5) #() 
wae $(1+5)+2( , a6 '+3 T ap\p/’ 


where 2 and FR have the significance of (2.31). These quantities when expressed 
in terms of p and @ are, up to terms of degree 2k + 1 in p, equal to —f and g 
respectively. The terms in p“*' in each expression of (3.31) are, respectively, 





ee — acosé — b sin ® — lys:, fats —ccos@ —dsin#é — mxy:, 
which, on use of (3.22), become 
(3.4) —§ori1 + A cos 6 — B sin 6, forsa + Bos 6 + A sin 8, 


where 


(3.41) A= -—}(a+d), B = 3(b — cc). 
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In terms of xz and y the expressions (3.3) become 


FyiAy_ 


’ s 2 2\k sts: 
= ay g + (Ax — By) (xe + y) + , 


(3.5) 
a9 : waa 
9x 4 Oy 5 4 (Br + Ay) (2? +?) + ---. 
Ox oy 


The expressions in (3) can be solved for z and y in terms of z and w and substi- 
tuted in (3.5), whence we get the differential equations (3.1). 


4. Second preliminary transformation. Let us start with the system of 
differential equations (3.1) and modify the corresponding partial differential 
equations as shown in (4.3) below in order to take care of the terms in 7. 
The system (4.3) so introduced still presents for terms of higher degree in p the 
difficulty found in the last section for the terms in p”*’, so that a further 
transformation is useful. 

It is convenient to consider this under two cases with a separate theorem in 
each case. 

Casel. A # 0. 

THreoreM. By a real analytic transformation 


w+ he(w, z) +--+ + havsa(w, 2) 
v =z + go(w,z) +--+ + Gaesa(w, 2) 


h(w, 2), 


u 


(4.1) 


q(w, 2), 


where hw, z) and qi(w, z) are polynomials of degree i obtained by the substitution 
w = p cos 0, z = p sin 6 in p'h,(6) and p'q:(0) as found below (i = 2, 3, ---, 
4k + 1), the differential equations (3.1) are reduced to 


ai = —v + (Au — Bo) (wu? + v")* + Lu(u’ + v*)* + --- = U(u, »), 
(4.2) 
t = u + (Bu + Av) (uw? + 0°) + Lol’ + 0*)™ + --- = V(u, »), 


where in the real analytic functions U(u, v) and V(u, v) terms of degree higher than 
(4k + 1) have not been written and where L is determined below, (4.563). 

Following the method of the last section, the partial differential equations 
corresponding to (2.2) are 


oh 


aan Vw, z)+ ~ Z(w, z —q + (Ah — Ba) (h’ + @¥*, 


(4.3) 
h + (Bh + Aq) (Wh? + q°)', 


 -_ oq , 
—W (w,z) + =~ Z(w, z 


where W(w, z) and Z(w, z) have the significance of (3.1). 
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Write, as in (2.21), w = pcos 6, z = psin 6, whence (4.3) become 


h h 
m(1 +5 5) +22. —q + (Ah — Bq) (h? + 9), 


06 
(4.31) R 
a Q r) ° nb 
22 (1 + ‘) + ao = h + (Bh + Aq) (h’ + q)*, 
p dp p 
where 
= p[— sin 6 W(p cos 6, p sin 6) + cos 6 Z(p cos 0, p sin @)] — p° 


jie Ba”** + idee 
= plcos 0 W(p cos 6, p sin 0) + sin 6 Z(p cos 8, p sin 0)] = Ap“ *? + . 
Now let us seek a solution of the partial differential equations (4.31) of the 
form 
(4.311) h = pcos@+ > p'h;(0), q = psin@+ = p'q:(8). 
Using this form for the solution we arrive at the system of ordinary differ- 
ential equations similar to (2.4): 


7 dh; dq; 
(4.32) 7% 5 = h, 
for] S71 S 2k + 1, while for 2k +1 <7 
= - <>*e* Bo ee ae ee 
+ 2k(A cos é@ — B sin 6) (cos @ hyo. + sin qi) + m,, 
(4.33) 
“a =h; - es —A(i — Qk)qi-e. + Bhi-oe + Aqi-s 


+ 2k(B cos é@ + A sin @) (cos Ohio, + sin Oqi-x%) +, 


where m; and n; involve h; and q; forj < (¢ — 2k). 

We shall first consider the differential equations for i S 4k and then consider 
the set for 7 = 4k + 1 separately. Let us suppose that for some integer ¢ = 
s(s S 4k), the h; and q; fori S s — 1 are of the required type. Furthermore 
let us suppose that the h;, q; fori = 2,3, ---, (s — 2k — 1) have been determined 
completely but that for i = (s — 2k), ---, (s — 1) there are terms in sin @ and 
cos 6 with arbitrary coefficients which are still undetermined. The solutions 
of these differential equations for even subscripts exist and are of the required type. 
For odd subscripts the solution of the differential equations for i < s by virtue of 
(2.42) and (2.43) is 


h; = M; + a; cos 6 a B; sin 6, 


(4.34) 
gi = Nie t+ (a; — Bi) cos 6 + (bs + a) sin @, 


where fori < s — 2k, a; and @; are determined as indicated below, (4.37), but 


are still left undetermined for s — 2k <i< s — 1. 
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From (4.32) we see that the solutions of the required type (4.34) exist for 
t= s S 2k +1 with M;, N;, a;, and b; all zero. We need then consider 
only the differential equations for those odd numbers s, such that 2k + 1 <s < 
4k + 1. In the expressions (4.34) take i = s — 2k and substitute them in the 
differential equations (4.33) withi = s. We get 





@ = —Gs + 0086 (a4 A(—s + 4k + 1) + 4,) 
(4.35) + sin 6(—2kBay—% + B,-%A(—s + 2k + 1) +6.) + me 
«t 0 
a = h, + cos 0 (8,2, A(s — 2k — 1) + 2kBay_x + c,) 


+ sin (A a,—ox(—s + 4k + 1) +. d,) + n¥, 
where m? and n* have no terms in sin @ or cos @ and involve h; and q; only for 
it S s — 2k, and where also the coefficients a, , b, , c,, d,, do the same. Note 
that these quantities do not depend on a, , B.—2* . 

The differential equations (4.35) lead to 
ah : 
——— = —h, 8 3s—2k <2 Ss des —F, 
(4.36) ae + sin 6 (2a,_2,A(s — 4k — 1) ) 
+ cos 6 (—4kBa,_2;. — 28.2 A(s — 2k — 1) + E,) + M, . 
where E, = b, — c,, F, = a, + d, and M, = —n* + dm* /dé. 


Now choose 


F 


“oe ™ ZAle — &F — 1)’ 
— 1 2k BF 
Bom = 570, — EF — 1) E o ent | 


This can be done for s # 4k + 1. 

With this choice of constants, the differential equation (4.36) has a solution 
of the required type where we now choose the arbitrary coefficients a, , 8, to be 
zero (i.e., when s > 2k). Hence there exist the required functions h;, q;, 
1 = 2k + 3,---,4k — 1, by virtue of the proper choice of the coefficients 
cio , Bio. These were the terms left undetermined in h; , gi, i < 2k + 1, 
so now with this set we have the functions h;, qi, i = 1,2, --- , 2k, 2k + 2,.--, 4k. 
The arbitrary constants a, , 824; are still to be determined. 

To determine hgxs1 , Gari, let us consider the differential equations (4.35) for 
s = 4k + 1; they become 





Sees = —Gaesi + A541 0080 + (—2kBaryy: — ZAKBox 4: 
_ + bya) sin @ + mig, 
see = hava + (2A Bregs + ZkBorsy1 + Ca 41) C088 + densi 8in @ + ning, 


where the terms have the significance given above (4.35). 
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The second order differential equation becomes 


dh — 
(4.41) —— = hays + Euyi cos? — FPysyi sin 0 


—4k( Booey: + ABx+1) cos 6 + Muy. 


We see that no choice of aex41 , Bexe+1 suffices to give (4.41) a solution of the re- 
quired type, unless (@yii1 + daeyi1) = 0, ie., Pays = 0. 
If (gies ae 441) = 0, choose 


(4.42) Baas: + ABoxr +1 = (Baers — Cari) /4k = Buis/4k, 


and the solution hy, of (4.41) will be of the required type, while the constant 
L of the lemma is zero. 

However, F'4.,; = 0 is not the general case, so we proceed under the assump- 
tion Fy. ~ 0. 

Now as in (3) use the functions h; , g;, ¢ = 2,3, --- , 44 just found, to make 
the change of variable (4.1) which in terms of p and @ becomes 


u = R(p, 0) = pcos 0 + pho(O) + --- +p * hyss(8), 
v = G(p, 0) = psin @ + pqo(O) + --- + pe” ques1(8), 


where hyxsi(9), daxsi(), and the coefficients ages, Besa, IN heeia(O), qeesi(O) are 
determined below. 
In terms of p and @ we have, as before: 


(a) mL ah(p, 8) (1 5) + ah(p, 0) R 








(4.51) dt ~—=« p* dp p’ 
: dv _ aq(o, af aq(p, @) R 
(b) a” 20 1+ 2 + ap p’ 


where 2 and & have the significance of (4.31). 
But up to terms of degree 4k + 1 in p we have the two expressions in (4.51) 
equal, respectively, to 


(a) —q@ + (Ah — Bah + 7)", 
(b) A+ (Bh+ Ag(h’ + 7. 
The terms in p“*" in (4.51a) and (4.51b) are 


(4.52) 


(a) Sew + B(—axys1 sin @ + Bes: cos 6) + a’ cos@ + b’ sin é 


+A (aren 41 cos 8 + Box+1 sin 6) (2k + 1) + m'(@), 
(4.53) 
(b) dqars1 


, B + Blaxs: cos @ + Box41 sin @) + c’ cos 6 + d’ sin 6 


+ Alas: sin @ — Bus: cos 6) (2k + 1) + n’(@), 
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while the terms in p“*’ in (4.52a) and (4.52b) are 
(a) —Qsesa + A (eens: COS O + Boxy sin 6) + a” cos 6 + b” sin @ 
(4.54) — Bars: sin 6 — Bors; cos 0) + 2k(A cos 6 — Bsin O@)axs, + m’’(8), 
(b) Rsesr + Boryi(—A cos 6+ B sin 0) + c” cos 6 + d” sin 6 
+ acx4i(B cos 6 + A sin 6)(2k + 1) + n’’(8), 


where sin @, cos 0, a2x+1 , Bes: occur only where explicitly written. 
Now introduce the functions h*(6) and q*(@) satisfying the differential equa- 
tions 


ih* dq* 
(4.55) > = —q* + min4s(6), =a = h* + ninsi(9), 
where 
m’"(6) — m'(0) = mix1(8), 
n'"(0) — n'(0) = niesi(8), 


as written in (4.4). This system (4.55) has a solution h*(@), q*(@) of the re- 
quired type. 
Now, for h4iss(O), Gaxs1(0) of (4.5) write 


haris(0) = h*(0) + uw sin 8, 
darxi(O) = g*(0) + Asin 8, 
and substitute in (4.53a). This becomes 


* 
> + cos 8+ axsa[—B sin 6 + (2k + 1)A cos 6] + a’ cos 6 


+ Db’ sin 6 + Bex:[B cos 6 + (2k + 1)A sin 6] + m’(@) 
= —qg* — Asin 0 + Asin 0 + muss + uw COs 6 
(4.56) + Beriil[B cos 6 + (2k + 1)A sin 6] + a’ cos 6 + DB’ sin 6 
+ axa[—B sin 6 + (2k + 1)A cos 6] + m’(0) = —Garss + m’’(0) 
+ a” cos 0+ b” sin 6 + Alas: COS 6 + Boxy, sin 6) 
— Bax. sin @ — Boxy; cos 6) + QZkaex4:(A cos 6 — B sin @) 
+ cos O(—A4ny1 +) + (—Daegr + WZWBaoey: + ZWABoxs1 + A)sin 8; 
i.e., (4.53a) becomes equal to (4.54a) except for the expression 
(4.561)  (—@aesi + uw) cos O + (—dacgr + 2ZkBars, + ZWABxy, + A)sin ¢. 


Hence the expression (4.51a) equals the expression (4.52a) up to and including 
terms of degree 4k + 1 in p except for (4.561). 

In the same way it is found that the expression (4.51b) equals the expression 
(4.52b) up to and including terms of degree 4k + 1 in p except for the terms 


(4.562) (X —- to. — 2kA Bonss — 2k Bazy..,)cos 6+ (—u _ dgxs)sin 6. 
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Now choose the constants as follows: 
Barks: + ABrkir = (Darr — Cresr)/4k, 
B= 3(—Gaey1 — daes1), 
NK = 3(Dsesar + Carsr), 
whence (4.561) and (4.562) become L cos @ and L sin @ respectively, where 
(4.563) L = 3(—dynss — ges). 


The theorem follows now as in the preceding section. 
Case Il. A=0,B #0. 
The partial differential equations (4.3) now become 


a (1 + OY + ee 9 — Bali? + 0°, 
dp p 


a6 pe 

(4.6) 
“4 (1 + ‘) + aq k =h + Bh(h® + q)*. 
00 p dp p 


Again let the formal solution be (4.311) and let us follow the procedure of 
Case I. If we attempt now to solve the differential equations corresponding to 
(4.35) in h, , gq. , we are led to 


P 
(4.61) pat = —h, + F,siné + (—4kBa,_» + E,) cosé@ + M, 


where the terms here have the significance of (4.36). 
We thus see that a solution of the required type exists provided F, = 0 and 
provided the arbitrary constant is chosen as follows: 


As 27% = E, /AKB. 
However in the general case, for some s = 2p + 1, 
Pou # 0, p>k, 
in which case we have the following theorem: 
Turorem. By a real analytic transformation 
uz W a 4 oe 
v=2+---, 
the differential equations (3.1) are reduced to 
du 
dt 
dy 
at 


—v — Bow’ + v’)* + Au(? + 0)? + ---, 
(4.7) 
u + Bu(u® + v*)* + Ao(u® + 0”)? +... 
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The discussion here is much the same as in the preceding lemmas and will not 
be repeated. 

Let us note a further change which can be introduced both in (4.2) and (4.7). 
It is essentially a change in the parameter ¢ such that 


(4.71) dt = dr/{1 + B(w? + v’)*}. 


The system (4.2) becomes 


- = —v+ Au(u’ + v)* + Lulu’? + o*)* 4+ --., 
T 
(4.72) ; 
= = u + Ao(u® + v*)* + L'o(u® + v*)* 
T 


where L’ = L — AB. The system (4.72) is the form sought under the pre- 
liminary transformations. 
However, the system (4.7) under (4.71) becomes 


i ‘ , 

ee oe —v +Au(u® + 2”)? + .-s, 

dr 

dv 2 2\p 
=u+Ar(u +r)’ +.--, 

dr 


which is in the form (3.1) with A # 0 so that Case I of the theorem in this 
section applies. Thus the ultimate form in each case is (4.72) above. 

Note that after the proper change of parameter, above, has been found, a 
corresponding one could be made in the initial set of differential equations (1.1) 
atonee. Beginning with the set (1.1) so modified no further change in parameter 
is necessary and we arrive at (4.72) under changes in the dependent variables only. 

The set of differential equations (1.1) transform under the 1-1 analytic 
transformations (3) and (4.1) which contain arbitrary constants into a system 
of the above form with unique k, A, B, and L. The system so obtained is 
spoken of as unique. When in addition the change of parameter (4.71) is intro- 
duced, the system attained (4.72) again has a unique set of constants, k, A, LD. 

These statements can be established by observing that if there were a different 
system attained, there would exist a 1-1 analytic transformation taking the 
one system into the other. This leads to a contradiction. 


5. Convergent case. It is of interest to examine what occurs when A; = 0. 
For the case where the functions X(z, y) and Y(z, y) are polynomials, Poincaré’ 
showed that if the partial differential equation 


(5.1) Fi +! s)+ 2k =o 


dp p 


> Poincaré, loc. cit 
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has a formal solution of the required type, that solution is also convergent. 
The existence of this integral of (5.1) implies the existence of analytic solutions 
of (2.2)* so that for these cases there exist analytic functions which accomplish 
the transformation. 

This case can arise in (2.2) or (4.7). In the former the system of differential 
equations becomes 


du dv 
—_ = —v = 4 


dt . dt 


In the case of (4.7) the system becomes 


du tied 2 2yk 
at = v(1 a Blu + u ) ), 
dv ae 2 2yk 
a u(1 + Bw + v°)"). 


6. The formal transformation. If we attempt now to transform the system 
of differential equations (4.2) or (4.72), we find there does exist a formal trans- 
formation of the required type. The method followed in establishing this is 
precisely that of §4; in fact, the partial differential equations here 


oh Q oh R 9 yk 9 ra) 
(1 + :) +22 = ~q + (Ah — Bg) (i? + qt + Lah + 9°), 
00 Op p 


21 (1 + 5) + 0% + (Bh + Ag) (H+ G+ Lal + 0° 
a0 p dp p 

differ but slightly from those of (4.31) and the related expressions corresponding 
to (4.35), (4.36), and (4.37) likewise differ very slightly from them. How- 
ever the term in p“*’ which caused the difficulty before is now taken care of 
by the term involving the constant L. This term also contributes the slight 
differences between the expressions here and the corresponding ones of §4. The 
result is stated in the following theorem, whose proof will not be given: 

Tueorem. The system of real differential equations 


i = —v + (Au — Bo)(w’ + v°)* + Lula’ +o)" + --- = Ua, 0), 
(6.1) 

Fi =u + (Bu + Av) (uw? + v*)* + Lo(u® + 0°)” 4 --- =V(u, »), 

( 


where LU’ and V are the real analytic functions of (4.2), can be transformed to the 
unique (see §4) system 


z = —v1 + (Am — Boy) (ui + oi)" + Lu(uy + 0)*, 
( 
(6.2) 

F = uw + (Buy + Avy) (ui + vi)" + Le(ut + 03)™ 


‘ Birkhoff, Am. Jour. Math., vol. 49 (1927), p. 37. 
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by the formal transformation 
2 2\k i 
Uy = U + (agi + Boyav) (ue +o) + om aur, 
2 2\h i 
U + (Berit + areyiv) (uw? + vo) + SY dyu'v’, 


(6.3) 


Vv) 


where 
i+j>2k+1 
ABorss + Barges, = 0. 
Note that, if desired, the quantity B can be taken equal to zero by virtue of 


the form (4.72), and here also there is but one form for the transformed differ- 
ential equations. 


7. Auxiliary functions. The series (6.3) of the last section, which accomplish 
the desired transformations, are formal and not necessarily convergent. Conse- 
quently these expressions are not satisfactory and we seek more appropriate ones. 

Following Birkhoff,’ introduce the associated series 


— f(u,v) = u + (au + Br) (Ww? + vw) + } ait — elitia; 1cateerit ) u'v’, 
i 


=~ 
g(u,v) = v + (—Bu + av) (uw? + vo) + 7 bi Ai — hi ict erie ) uv’. 
These series are of a type considered by Ritt® and by Birkhoff,’ so that only an 
outline of the proof of the following theorem, which is completely analogous to a 
theorem of Birkhoff’s (loc. cit.), is given here. 
TueoremM. The real functions f(u, v), g(u, v) defined in (7.1) are analytic in 
the real variables u and v for 
0O<v Sut Sp 
and C™ for 
Os uw? +v< Po 


and such that 


itd g | ij 
ai = a,;t!j!, mie, = b,;t!j!. 
In the region of complex space, 
ju +v | S po, 
argu] < oF, jarge| < Zr, 
or 
|r —argu| <<, | —arge| <=, 


5 Birkhoff, Sitzungsberichte Preus. Akad. (1929), pp. 171-183. 
6 J. F. Ritt, Annals of Mathematics, (2), vol. 18 (1916), p. 18. 
7? Birkhoff, loc. cit. 
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we find 
| a —\/}  Qlullol? 
Jasju' — em) | < Ae tel 
ne eo a 
where 
2 2h 
Bi = {1+ i ajl}wtvy. 
We also have 
ju +v| >V2pipe, 
where 
1 a 


i@ i 
u= pie’, v= pre ~ 


For the terms under consideration, 7 + j > 2k, divide the treatment into the 
following three cases: 








(a) i 2k, j 2k; 
then 
2 u|' y |? ‘ - ) J : 
at Li? 6 ete 2! ul! By |F-*, 
| u? + v? |* \u Kiv \é Dk2 ' ' 
(b) i<k, j>mk-i: 
then 
‘ 2} J 9 \j-a 
“|U v o|v © i 2k 
| | | < | | < 2\v} } 


| 2 fe 2 |* = Die] yy? yp? jk 
(ec) j<k, i> 2k —j. 
As in (b), these terms are less than 2| u |". 
Thus we have as a dominating series the expression 


i=2 t=k 
iim” i=k "nema 
| —k j-k | 2h < 2k 
2 jul "fol?" +2 | u | +3 3. je|* 
i=k j=1 i=1 
j=k i=k+1 j=kt+1 


Consequently we have the analyticity of the function in the open region and 
continuity at the origin. 

Let us now consider the partial derivative 4/du of the series. A typical term 
for this differentiation becomes 


—1 —1 ol. 
. i-l j Bea aj; +1 J So; 2k 
104; U r(1 —€ 3) - — 6 vo emi — 
2 y2\k+ 
1+ fay; (u? + 2) 





As above, we have 


" —1 j 
e ae —j\| _, bt rh 
| tay;u' ri _ em) S2jul' * ‘|r| 











TRANSFORMATION OF DIFFERENTIAL EQUATIONS 409 


For the second part of the derivative of the term we find 


=f 
Mi __ yi eittj — = | # 2k | u | * |v Pt, 
1 + |a,;| (u? + y?)*+ 
provided i>k+1, j>k+l. 

The terms for which these inequalities are not satisfied are treated as under 
(b) and (c) above. Thus again we have a dominating series and consequently 
the derived series converges uniformly and absolutely and hence this partial 
derivative has the stated properties of continuity and analyticity. 

Higher derivatives are treated in the same manner. 

The limits approached by the derived series are as stated. The proof merely 
depends on the observation that each term of the derived series a°*’/(a@u‘av’) 
approaches zero with (u” + v”) except the one of the form 


- 
ij! aha _ emi), 
whose limit is i!j!aj;. 


Introduce the variables p, 6 in the functions (7.1), where 
u = pcos 8, v = psin @. 
The functions become 


f(p, 0) = p cos @ + (a cos 6 + B sin @)p* 


1 
> oat —¢ i) cos '# sin 78, 


(7.2) | 
a(p, 0) = psin@ + (asin @ — Bos @)p"' 
+>. ob —- ei) cos 6 sin ’8, 
where 
Ai = (1+ ais |}e", By = (1 + | bis |ie”, 
which are analytic in the open region 
0< pS po 
05608 2r 
and of class C™ in the closed region 
OSps po, 
0 < @ SS 2r. 


Considered as power series in sin 6, cos 6, the expressions (7.2) are uniformly 
and absolutely convergent in the closed region. Furthermore, these functions 
are characterized by the following theorem: 
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THeorem. The functions (7.2) can be expressed as power series in 6: 


(7.21) F(p, 0) = folp) + filo)@ + fale)® + ---, 
(p, 8) = golo) + gr(0)8 + go(p)6* + --- 

convergent for 

oS°a», 0p po, 


where the functions fi(p), gi(p) are analytic for 0 < p < po and of class C” for 
0x v = Po. 
Consider the first expression in (7.2); we have, as seen above, 
| 1 | 
lasso(1 — et) | < agit 
Consequently the series 


D> dip? = 2p'*?™ cosh ‘6 sinh ’@ 


dominates the series obtained by expanding the term 


a“. 
ai;p' H( _ eta) cos ‘6 sin 76 


in powers of @. 
Furthermore, the series 


ao(p) + ay(p)0 + a2(p)0 + --- = 5 2p'*?* cosh ‘é sinh 76 


converges uniformly and absolutely for p < p; sufficiently small and 0 < @ S 27 
and all the a;(p) are positive. Then, since 


| file) | < ai(p), | gi(e) | < ai(p), 


the series (7.21) converge. It is readily shown that the fi(p), gi(oe) have the 
additional properties stated in the theorem. 

A similar discussion holds for the partial derivatives, and we have the second” 

TurorEem. The first partial derivatives of the functions f(p, 0), g(p, 0) exist 
and can be expressed as the convergent series made up of the corresponding partial 
derivatives of the terms of the series (7.21). 

Now introduce new variables a, # by means of (7.1), as follows: 


(7.3) ui = f(u, v), d= g(u, v). 
The variables u, v can be expressed as 
(7.31) u = f*(d, d), v = g*(u, d), 


where the functions f*, g* have the same properties as f(u, v) and g(u, v) of the 
above theorem. 


8 See Bécher, On semi-analytic functions of two variables, Annals of Mathematics, vol. 12 
(1910-11), p. 18. 
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Put i = rcos¢,d = rsing and use (7.2). Then we have 
r cos @ = f(p, 8), 
rsin @ = G(p, @). 


The properties of the inverse transformation, the expressions for p, @ or p, 
sin @ (or cos @), can be established in the following theorems. (The proofs are 
omitted.) 

TuroreM. The system of equations (7.32) can be solved for p, 6 in the form 


(7.32) 


(7.4) 6 =o + rhi(r)¢’, p=r+t =rmi(r)¢' 


convergent for 0 S @ S 27,0 Sr S 1%, where the hi(r) and m,(r) have the properties 
of fi(p), gi(o) in (7.21) above. 

For the second theorem we have: 

THrorEeM. The system of equations (7.32) can be solved for p and sin 6 or p and 
cos 6 in the form 


p=rt > Ahir) cos ‘¢ sin “4, 
sin 6 = sing + > nij(T) COs ‘p sin “6, 


which are uniformly and absolutely convergent considered as power series tn sin ¢, 
cos @, for 


(7.41) 


0 < ¢ S 2, 0 


lA 


r < To, 


where hi(r) and ni;(r) are analytic in r for 0 <r S ro and of class C* forO Sr S 
ro , and are such that 


h;;(O) = 0, nij(O) = 0. 


8. Auxiliary transformation. In the system of differential equations (6.1) put 
u = pcos 6, v = psin @, 


and then introduce the new variables r, ¢@ by means of the transformation (7.32) 


(8.1) recos @ = f(p, 8), rsin@ = G(p, 4), 
whence 
(8.11) r= VP aa 7 = M(p, 6). 


The system becomes 
dr/dt = Ar*“** + Lr“ + R*(r, 9), 


(8.2) “ 
do/dt = 1 + Br* + O*(r, ), 


where the properties of the functions R*(r, ¢) and Q*(r,@) are given in the fol- 
lowing theorems. 
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THeoreM. The function R*(r, @) of (8.2) is analytic in r,¢@ for0 <r Sm, 
0 < ¢ S 2r, of class C” for0 S r S m,0 S ¢ S 2x, and can be written R*(r, ) = 
Ror) + Ri(r)o + --- , which converges for 0 S r S m,0 SF o S 2rz. 

To prove the theorem, let us note the form of R*(r, ¢). Introducing r by 
means of (7.41), we get 


dr _ aM Xp, *) aM (Be ”) 
eas di ~ 00 (1+ rae a > ara a 





an expression in p and 6, where 2 and R arise from U’ and V here, as they do 
from X and Y in (2.31). In terms of r, ¢ this becomes 


(8.31) Ar’! + Lr“*" 4 R*(r, 6), 


where the expression R*(r, @) can be written in terms of p and @ as 


(8.32) (1 + ) eM | ROM _ sy _ LM“* = R*(p, 6), 
p’/ 00 p op 

whence from the theorems of §7, this expression in terms of p and @ has the 
required properties. Consequently, it also has the required properties in terms 
of rand ¢@. Note that where ¢ enters in R*(r, ¢), it can be expressed entirely 
as powers of sin ¢ and cos ¢. 

A similar result holds for Q*(r, ¢). 

A second theorem is as follows: 

TueoreM. For any positive integer p, 
R*(r, o) 


r Pp 


lim = 0. 


r=0 
For the expression in (8.11), write 
(8.4) M(p, 0) = M'(p, 0) + M''(p, 8), 
where M'(p, 6) is made up only of those terms of M(p, 6) which involve the 
terms of (7.2) with coefficients a;; , b;; such that i + 7 S p, while M’'(p, 8) is 


made up of the rest of the expansion of M(p, @). 
Now put 


(8.41) M’(p, 0) = Mi(p, 0) + Mo(p, 8), 
where M,(p, @) is of the form 
> ai; cos '@ sin ’6p'”’ (+) p), 
while M2(p, @) is made up of terms, each of which contains at least one factor 
of the type gli 
We have 





(8.42) mt: + 5) + i = AM? 4 LMM! 4 N(p, 0), 
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where N(p, @) is a power series in p, cos 6, sin @, such that each term is of degree 
greater than p in p. 

Now, for the function R*(p, 6) of (8.32) we have 
2 
is 
— A(M, + M; + M”)**" — L(M, + M + M”)“™ 


R*(p, 6) 


9 (Mm, + Me + M”) (1 re ) 42 (v4 Mm. +M" *® 
06 dp p 


(8.5) = N(p, 0) + AM? + LM} 
— A(M, + M; + M”)** — L(M, + M; + M"”)** 
a " Q a mh 
Consequently, we see 
* 
lim R*(o, 9) = 0, 
p=0 p” 
and thus 
*/.. 
lim z “ ¢) = 0 


r=0 } 


A similar result holds for 2*(r, @). 

Finally introduce into the differential equations the variables a, i of §7. We 
immediately have the following facts: 

Tueorem. (1) Under the transformation (7.3) the system of differential equa- 
tions becomes 


: — + (Aa +- Bd) (a - p)* a Lait + x)* + U(a, d), 
(8.6) ; 
4 = a + (Ba + Ad) + 0) + Low’ + #*)* + Va, 0). 


(2) The functions U(a, 6), V (a, 5) are analytic in the open region 0 < (a + P 
< pi and class C™ in the closed region. 
(3) For any integer p, 





. O(a, d) 

ne Gat + oe 
7=0 

: V(u,o) 
lim ca 4 pty = 9 
7=0 


9. Final transformation. The final transformation is introduced in the 
following 
THreoremM. There exist functions 











414 c. I. LUBIN 


(9.1) fi=ait+---, ?=ov+.--- 


analytic in the open region i + 0° > 0, of class C* in the closed region @” + * = 0 
such that the differential equations (8.6), under the transformation w = fi,z = 1, 
become 


a = —2+ (Aw — Bz) (2 + w’)* + Lule’ + ww’), 
(9.2) : 
Gm wt (Bw + + Az) (2? + w*)* + Le(2? + w’)™. 


To establish this theorem let us first prove the lemma below: 
Lemma. The partial differential equation 


(9.3) (1 + Br™ +.- =) 7 ~ (4 irr" «4 *) ow Ape" 4. oe 
where A, B, L, R*(r, ¢), &*(r, &) have the same significance as in (8.2), has a solution 
(9.31) Fort > F;(r) cos‘ sin’ 
analytic in r for r > 0, of class C* for r = 0 and such that 

lim tt = 0 


for any positive integer p. 
Consider the two cases L = Oand L # 0. 
Casel. L=0 
If L = 0, the partial differential equation (9.3) becomes 


F * 
(9.4) oF (1+ Bi +E) 4S (ae 4B) = ae 
dg or r 


Make the substitution 
(9.41) f= 


in the partial differential —- (9.4). We get 


4 2k+ R* 2kk* 
(9.42) £1 4 Br +- )+ L(A, we) sao 
and f = 1+ ---, where the part of the expression for f which is omitted 


vanishes with r to a power higher than the first. 
Now introduce the variables 


—_— s log r 
QkAr*®  A’” 


p=". 


(9.43) "vees 
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The partial differential equation (9.4) becomes 


af (a* R*(1 + Bp“) af/,wn, RB 2kR* 
(9.44) ae - a —\Ap +— —— f=9. 
av Vp Apt > ap , + > pe f 

The expressions R*(r, ¢), 2*(r, ¢) above become, in terms of p, ¥, R*(p, W), 
Q*(p, Y). These expressions have the same properties in terms of p and y that 
the original expressions, R*, 2*, have in terms of r and ¢. 

Finally write (9.44) as 


af —2kR* 
(9.45) . a i SE 
dp 2k+2 2k+1 ow tl ok+1 R* oy 
p \Ap” + — Ap : 


p 








where f(0, y) = 1. 
Now let us consider the other case. 
Case2. L # 0. 
Here make in (9.3) the substitution 


l L | l Al ] L | 1 «| 
9! F i Res ae oe i ac ee oe oe 4 
(9.5) A log u Pa + — log Pe =o += log xz + 


The partial differential equation becomes 











2k 2 
2 du 2k o* Ou 2k+1 4k+1 R* Tr. 
9 — Z a —itAr wv — = a 
(9.51) (14 B + a) + 2H (ay +h + ) Arh 4 Dr’ 
where u = 1 + ---. Introduce in (9.51) the new variables 
AB-L, A+L” . 1) 1 
9.! = =n ent om) sans 
( 52) ¥ co) + { 4 log yok + 4} QA ? 
p=T, 
whence 
_ Re 
au vo 1 - 2k > uUu 
dp ip 4 Tpit 4 R* Ap’ + Lp"* 
(9.53) p . 





du [ o* 1+ Bp“ R* 
rw} ro ma Ap*™*! + Lp#*t 77 ’ 


where 9*(p, ¥), R*(p, y) arise from Q*(r, ¢), R*(r, ¢). 

By virtue of the lemma below, each of the partial differential equations (9.45) 
and (9.53) has a solution in terms of cos y, sin y, p of the form (9.71) below, 
convergent for 
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pS po, 0 Sy S 2r, 


where the coefficients of the cos ‘y sin °y have the property (9.72) and where 
u(0, y) = 1. 

Returning to the variables r, ¢@ by means of (9.43) or (9.52), the corresponding 
function f(r, ¢) is also of the form (9.71) convergent for r S ro, 0 S @ S 2z. 

Finally introduce the function F(r, ¢) by means of (9.5) or (9.4). We find 
in both cases that this function, a solution of (9.3), likewise can be expressed 
in the form (9.71) with the above properties. Consequently we have the lemma 
established. 

If the same transformation (9.43) or (9.52) is made in the partial differential 
equation (9.6) the following lemma can be established: 

LemMa. The partial differential equation 


aG ac 


* ’ * 
oe se ( + Br 4 ©) + i(ae + Ltt 4 = 


/ 


yok 
> ) =1+4+ BI 


has a solution 
G=¢o+ y» gi; COS '@ sin “ 
analytic in r for r > 0, of class C* in r for r = 0 and such that 


lim Z/ = 0 


“Pp ’ 
r=0 r? 


p any integer. 

Now writing w = f; = F cos G,z = g; = F sin G, and expressing F and G in 
terms of u and v by u = r cos ¢,v = rsin ¢, the theorem of this section can be 
established. 

Finally, the lemma appealed to above can be stated as follows: 

Lemma. In the partial differential equation 


(9.7) a = S(p, 0)f + T(, 6) 2 
let the functions S(p, 6), T(p, 0) be analytic in p and 6, p > 0 class C*, p = 0 ex- 
pressible as 
on (a) S(p, 0) = D> Si(p)6’, 
(b) S(p, 0) = >> Si(p) cos 8 sin’6 


both uniformly convergent for po 2 p 2 0 and such that 


. S; . S; 
(9.72) lim Silo) = 0, lim ife) = 0 


» 
p=0 pi p=0 =p 


for any integer p, with similar properties for T(p, 0). Then there exists a function 
S(p, 0), analytic in p and 6, p > 0, of class C*, p = 0, expressible as S(p, 0) is above, 
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(9.71) and (9.72), and satisfying the partial differential equation (9.7) and 
f(O0, 0) = 1. 
This lemma can be established by setting up the sequence 


fi=1, 


(9.73) . 3 f.) 
fin = 1+ i {St @)fi + T(p, 0) => dp. 

0 30 | 
The functions S and T are expressed as under (a) above to prove the conver- 
gence. The proof follows in the way used for ordinary differential equations.’ 


10. Conclusion. Combining the transformations of §8 and §9, we take the 
system of differential equations (6.1) into the final form (9.2) using a trans- 
formation analytic in the open region, x” + y’ > 0, and of class C* in the closed 
region x’ + y’ = 0 for (2° + y’) not greater than, say, d’. 

As indicated above, the form (6.1) is attained by an analytic transformation 
from the initial form (1.1) or by an analytic transformation and the change of 
parameter indicated in §4. 

Sonsequently the final form of the system of differential equations (9.2) is 
attained from the initial system (1.1) by a transformation analytic in the open 
region, of class C” in the closed region including the origin, or by such a trans- 
formation and a change of parameter. 


UNIVERSITY OF CINCINNATI. 


® Picard, Traité d’ Analyse, vol. 2, 1925, p. 368. 














THE MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE 
CONDITIONS 


By Herman H. Goupstine 


In his doctoral dissertation the author obtained a generalization of the cal- 
culus of variations which does not include either the problem of Lagrange with 
fixed end-points or the more general problem of Bolza. That is to say, the 
theory therein presented includes only the ordinary problem of the calculus 
of variations and certain non-calculus of variations problems which have fixed 
end-points and no side conditions.’ To remedy this defect a more general 
situation is considered in the present paper; more specifically it is proposed to 
find conditions that a functional having certain differentiability properties be a 
minimum in a class of functions satisfying a system of integro-differential equa- 
tions and passing through two fixed points. 

This problem is transformed into one having only generalized end conditions 
and is then treated by a method which is a generalization of the technique 
adopted in the author’s thesis. Analogues of the Lagrange multiplier rule, the 
Clebsch condition, and the Jacobi-Mayer condition are obtained for the trans- 
formed problem. 

The analogue of the Jacobi-Mayer condition is especially interesting, since 
the theory of the fixed end-point problem of Lagrange with integro-differential 
side conditions does not contain such a condition.” Moreover this condition 
does not reduce to the ordinary Jacobi condition for the simple problem of the 
calculus of variations, as has been shown.® 


1. The problems and the transformation. We shall start with the following 
problem: to find necessary conditions that an are 


bs = &(s) (i =1,--+,n;0Ss1) 


minimize a functional J in the class of ares satisfying the integro-differential 
conditions 


Received October 26, 1936. 

1 Conditions for a minimum of a functional, Chicago Doctoral Dissertation (1936); it is 
expected that this paper will soon appear in the third volume of the Contributions to the 
Calculus of Variations (Chicago). In the subsequent footnotes it will be indicated as 
Paper I. 

? See, e.g., L. M. Graves, A transformation of the problem of Lagrange in the calculus of 
variations, Transactions of the American Mathematical Society, vol. 35 (1933), pp. 675-682. 

3See Paper I, pp. 32-37. 
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alz, E(x), u(x)| = 0 (a = 1,-*+,m <n), 
(1) * 
u,(z) = [ P,[z, s, &(s)] ds (y=nt+1,---,n+4q), 
0 
(2) u;(x) =a;+ [ £,(s) ds (i wm ]i---,. n) 
0 
and the end conditions u;(1) = b; (¢ = 1,---, n). The functions ¢, are 


assumed to have continuous partial deriv: om es of the second order with respect 
to all their arguments in a given region 9; of (2n + q + 1)-dimensional space. 
The functions P, are supposed to possess the same differentiability properties 
as the functions g, on a region 3, of (n + 2)-dimensional space. The curves 
to be discussed belong to a certain region @, of the space ©, of n-uples of 
continuous functions defined on 0 S x S 1. Moreover on this region Go, 
the functional J has a second differential. 

The region 2 of (2n + q = 2)-dimensional space is, by definition, the set of 
all points (x, 8, yi, <** 5 Yns Us °° * 5 Ung 4) for which the point (x, 8,91, °°" 5 Ya) 
is in JX» and the point i a. 2+ Yn, Uy", Unsg) is in MN. Then the 
region ©, has the property that the point 


[x, s, &(s), u(s)] (0<sS2 21) 
is interior to the region 3 whenever the element (&, --+ , &) is in @,. Lastly 
along the minimizing “curve” 

for = fos(s), Us = U,(s) (@=1,---,n;6 =1,---,n+4q), 
the matrix of partial derivatives (@ay;) (@ = 1,--:,m;t = 1,---, m) has 
rank m. 


The second problem to be treated is the following: to find necessary condi- 
tions that an element £ = (&:, --+, &,-), lying in a certain region @, of the 


space of r-uples of continuous functions defined on 0 S x S 1 minimize a func- 
tional J in a class of elements satisfying functional bate Stl 
A,(&) = 0 (u = 1, aide » P)- 


The functionals J and A, are supposed to have second differentials at each 
point of Gp, . 

It is simple to verify that the transformation adopted by Graves in treating 
the problem of Lagrange’ carries the first problem into a special case of the 
second; moreover the transformation is such that necessary conditions for one 
problem are necessary for the other. 


* For definitions of the terms used above see, e.g., L. M. Graves, Topics in the functional 
calculus, Bulletin of the American Mathematical Society, vol. 41 (1935), pp. 641-662; 
T. H. Hildebrandt and L. M. Graves, Implicit functions and their differentials in general 
analysis, vol. 29 (1927), pp. 127-153; or Paper I. 

5 A transformation of the preblem of Lagrange in the calculus of variations, loc. cit. 
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However, before proceeding to a treatment of the latter problem, we shall 
show that it can be transformed into an equivalent problem which is nota- 
tionally simpler to analyse. To effect this transformation we make the fol- 
lowing definitions: 


(3) £(x) E(x —i +1) G-—-ls2z<t;t=1,---,r), 
t(r) = £(1), 
(4) K(é) = J ((é: a é,)], B,(é) = A, [(& “Se é,)] (u = 1, rae Pp), 


where é is given by equations (3). Then the domain of K and B, is a region 
© of the space of functions — which are defined on 0 S x S r and are con- 
tinuous except possibly at r = 7 (¢ = 1, --- ,r — 1), where they have finite 
right and left hand limits; we define the norm of an element £ to be the largest 
of the norms of its sections,° whence K and B, have second differentials on this 
region. 

Making use of the Riesz theorem’ and the differentiability of K, one can 
verify that there exist regular functions « and 8, of limited variation® such 
that «(0) = 8,(0) = 0 (uw = 1,---, p), « and 8, are continuous at rx = 7 
(ij = 1,---,r— 1), and 


(5) dK(i;¢) = [vw dx(xr), dB,(&;¢) = [se dB,(r) (u=1,---,p) 


for every admissible variation ¢, i.e., for every function ¢ whose sections are 
continuous functions. 


2. The multiplier rule.’ Since K is a minimum at & in the class of all ¢ 
in @ satisfying the equations B,(~) = 0, the usual considerations of the cal- 
culus of variations suffice to show that there exist constants lb, ¢. +--+ ,¢ 
not all zero, such that 


ps 


lo dK (éo; 5) + c, dB, (S36) = 0 
for every admissible variation ¢. Hence defining A(x) to be Iox(r) + ¢,8,(x), 
we have 
(6) [ (x) dX(x) = 0, 
from which follows 
6 By definition, the i-th section £; of Eis &(x), where i — 1 S x < i, and £,(i) = - é(x); 


hence each section of £ is a continuous function on a finite and closed interval. 
7 See, e.g., S. Banach, Théorie des Opérations Linéaires, Warsaw, 1932, pp. 59 ff. 
* A function f of limited variation will be said to be regular in case 


S(z) = [fe - 0) + f(z + 0))/2; 


every function of limited variation can be so regularized. 
® See, e.g., H. Hahn, Ueber die Lagrange’ sche Multiplikatorenmethode, Sitzungsbericht« 
der Akademie Wien, vol. 131 (1922), pp. 531-550. 
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THe Muctipiier Rue. If the function f(x) minimizes the functional K in 
the class of all functions & in G which satisfy the equations 


B,(é) = 0, 
then there exist constants ly, ¢,, -+- ,¢,, not all zero, such that 
(7) Max) = x(x) + ¢,8,(2) = 0 (0s 27sr), 


where x and B, are the functions appearing in equations (5). 

To prove the theorem it suffices to note that equation (6) implies the vanish- 
ing of \ at = r and to use the method indicated in the author’s thesis.” 

An element & in @ will be said to be normal in case there exist p admissible 
variations ¢, (¢ = 1, +--+, p) such that the determinant | dB,(&; ¢) | (u, ¢ = 
1, --- , p) does not vanish. By a proof similar to the one made in the calculus 
of variations it can be shown that an element ¢ is normal if and only if it has 
no set of multipliers l = 0, ¢ , +--+ ,¢,, with which it satisfies equation (7). 
As usual we can take ) = 1 for a normal element and hence uniquely determine 
the remaining multipliers. 


3. The second differential. The usual considerations show that, if our mini- 
mizing element £ is normal, then dK(&; ¢) = 0 and dK(&; ¢, ©) 2 O for 
every admissible variation satisfying 


(8) dB,(é; 5) = 0 (u = 1,---, p). 
We find it convenient to impose a restriction on the functionals B, . 


(H.1) The matrix (dB,(é; 2°')) (uo = 1, -:-, p) is of rank p. 

Therefore the vectors 
(9) dB,(& 3 1), dBu(é03 2), - +> , dBy( eo; x” ') (u = 1, +++, p) 
are linearly independent, and £ is a normal element. Then there is no loss of 
generality in supposing that 
(10) 8,(0) = 0, dB,(& ; 2°) = bye (u, o=1,°:: Pp), 
where (6,.) is the Kronecker delta. Since the vectors (9) are linearly inde- 
pendent, there exists a non-singular matrix (¢,,) such that 

Oye = Cyj dB i(& ; "ite" 

Then the functionals C, = c,;B; have the desired properties; the system of 
equations dC,(&; ¢) = O has the same solutions as the system dB,(é; 6) = 0, 
and dC,(f; 2°’) = by. It will be supposed throughout the sequel that the 
system (8) has been put into this canonical form. Next it will be shown that 

Lemma 1. If d’K(&; v, v) = 0 for every function v whose sections have a 


continuous p-th derivative and which satisfies dB,(& ; v) = 0, then d°K(& 3 ¢,¢) 20 
for every admissible variation ¢ satisfying dB,(& ; ¢) = 0. 


'© Paper I, pp. 21 ff. 
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Suppose that there is an admissible variation ¢) for which the lemma is 
false. Then applying the Weierstrass approximation theorem to the sections 
of {> , we get a sequence of functions {v,} whose sections have continuous p-th 
derivatives and which converges in the sense of the norm to {). Let ™,(r) = 
vn(t) + Cin + Cond + +++ + Cypnz”. Then for each n and each set Cin, *** . Cpn 
of constants, , has the desired differentiability properties. Furthermore by 
choosing the ¢,, -** , Cyn 80 that ¢un = —dB,(£ ; vn)/dB,( ; x“ "), we see that v1, 
satisfies dB,(£ ; v) = 0 for every n. Moreover, since {» satisfies these equations, 
it is evident that for each » the sequence c,, converges to zero; 1, converges in 
the sense of the norm to {). This leads to a contradiction.” 

We proceed to a determination of the second differential. It can be seen 
from the proofs of Fréchet” that there exists a symmetric function k(x, y) of 
limited variation in the sense of Fréchet™ such that k(z, 0) = 0 = k(0, y) 
(0S 2s r;0 S y Sr), k is continuous in both variables together along the 








lines x = i, y = j (i,j = 1, --- , r — 1), and for every admissible ¢, and ¢:"* 
(11) K(k Sia f2) = I [ oi(x)fo(y)d-d, k(a, y), 
Then adopting the method of the author’s thesis,’” we define functions k, : 
' a on ny Biz) Bly) one, Bly) _ py Bil) 
ky(x, y) = k(x, y) + k(r, r) Anan) k(2, r) 3i(r) k(r, y) Br)’ 


k(x, y) = ds I k;_a(s, t)dt 
0 0 


er r | (x _ a *Bilsdds [ (y a s)’ *8,(s)ds 
> | as [ kjals, t) dt - - . _ 
0 0 


[ "(r — 8) *BAs)de [ “(r — 8)*BAa)ds 


z r [ (y — s)’*B\(s)ds 
- I as [ k;-1(s, t)dt —— 
0 0 


[ (r — s)**B(s)ds 


[ (x — s)’ *B,(s)ds 


r y 
-[ as [ k;_s(s, tat — —_——— : 
’ , [ (r — s)**B\(s)ds 
0 











1 Paper I, pp. 11 ff. 

2 Sur les fonctionelles bilinéaires, Transactions of the American Mathematical Society, 
vol. 16 (1915), pp. 215-244. 

18 Tbid., pp. 223 ff. 

4 See, e.g., Paper I, pp. 4 ff. The continuity property stated above results from the 
arbitrariness with which the values of k may be chosen. 

8 See Paper I, pp. 6-8. 
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Then the functions k, (u = 1, --- , p) are symmetric, vanish on the boundary 
of the square (0 < x S r;0 S y S 1); kh; is bounded, Riemann integrable,” 
and continuous on the lines g = 7, y = j (¢, j = 1,---,r — 1). Moreover 


by successively integrating by parts,” and using equations (9) and (10), we 
find that 


(12) &K(& ;V1,¥2) = i [ vi” (x)kp(a, y)vs”’ (y)dady (vy, in MN, » in N), 
o Jo 


where 9 is the class of all admissible variations v satisfying equations (8), such 
that the sections of v have continuous p-th derivatives. 

We find it convenient to make a second restriction on the generality of our 
problem. 

(H.2) If p = 1, it ts assumed that k, is continuous in both variables together. 


4. The Jacobi-Mayer condition. We proceed to establish the 
ANALOGUE OF THE JACOBI-MayerR ConpiTion. [If & is a normal minimizing 
element for K, then the integral equation 


(13) | “Keele, yely)dy = o¢(zx), 


where k,, is the unique continuous function appearing in equation (12), can have 
no negative characteristic values. Moreover if d°K(&; ¢, ¢) is not equal to zero for 
every admissible variation satisfying dB,(%;£) = 0 (u = 1, --- , p), then equation 
(13) always has at least one non-zero characteristic value. 

We prove first that k, is unique. Let x and x be two continuous functions 
effective in equation (12) and let x = x, — x. Then 


I [ v\”(x)x(x, y)vi? (y)dxdy = 0 (vy in MN; ve in M). 
0 0 
Consider any two continuous functions 


vi(z), ga); 


then define two functions »;(z), »2(x) as follows: 


z Est Zp-1 
(14) v(x) = [ [ ra [ gi(x,)dz, -++ day+ Cpst” tees $C15t + Cos 
0 Jo 0 
((=1,2;0<s27 <r). 


‘6 Tbhid., pp. 8-10. 

17 Thid., p. 11. 

18 Thid., pp. 23-24, 26-41. In this reference the condition above is obtained for the 
case r = p = 1, 6:(z) = 2; the result is interpreted for two special cases, one of which is 
the fixed end-point problem of the calculus of variations. 
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We can determine the numbers ¢0,;, «+ - , ¢p—1,; 80 that they satisfy the equations 


[ [ [ eee [ y:(x,)d Cp eee dx, dB, (x) + Cc 1:,4B (é a") 
0 0 ust 
(i _— .. 2; p = 1, Cte p). 


Then both »; and v2 are in %. Hence we have proved that 


[ [ evanc, y)go(y)drdy = 0 


for every pair (¢:, g2) of continuous functions; this implies that « is iden- 
tically zero. 

Suppose next that o is a negative characteristic value for equation (13). 
Then in (14) replace yg; by ¢, a normed characteristic solution corresponding 
too. Hence we have defined a function v in % such that 


l k(x, ye? (ydy = ov'”’(y), [ [vy ’(x)]? dx = 1. 
0 Jo 


This leads immediately to a contradiction. 
Finally if k, is identically zero, d°K(& ; v, v) = 0 for every v in N, whence 
by a method analogous to that employed in Lemma | we find that 


@K(&:¢,o) = 0 


for every admissible variation satisfying equations (8). Consequently the last 
statement in the condition follows immediately from the symmetry of k, and 
the Hilbert-Schmidt theory. 

Corouiary 1. If & is a normal minimizing element for K, then 


coat, 2) =O (n=1,2,---;0s 2S Pr), 
where k,,.(a, y) is the n-th iterated kernel for k,(x, y). 
The proof is the same as the one in the author’s thesis.” Apparently neither 


the condition above nor its corollary appear in the literature for any special 
. 20 
case of the theory herein developed. 


5. The Clebsch condition. We are able to state a simple generalization of 
the familiar Clebsch necessary condition. 

ANALOGUE OF THE CLEBSCH ConbiITION. Let & be a normal element which 
minimizes K in the class of all admissible curves satisfying B,(&) = 0, and let 


(15) n(x) = ¥(x 15) (Os xr<r) 


be a Lebesgue square integrable function of x for each admissible variation ¢ satis- 


19 See Paper I, p. 24. 
* 20Te., excepting the case r = p = 1, 8,(x) = x, which appears in the author’s disserta- 
tion. See, e.g., Graves, A transformation of the problem of Lagrange in the calculus of 
variations, loc. cit., p. 675. 
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fying dB,(&; 6) = 0 (u = 1, +--+ , p); further suppose that py is a homogeneous 
functional of § forO0 S x Sr. Then the lower bound B(&) of d’K(& ; ¢, ¢) in the 
class of all admissible variations ¢ such that dB,(& ; ¢) = 0 (u = 1, --+ , p) and 


(16) n(x)dx = 1 
0 


is finite; this number is also the lower bound of d’K(&; v, v) for all v in N satisfying 
equation (16). 

The first part of this theorem follows immediately from the homogeneity 
of y as a function of ¢ and the normality of &. In the proof of Lemma 1 a 
method for constructing sequences of elements in % which converge to any 
admissible ¢ with dB,(& ; £) = 0 was given. By means of this convergence 
property an indirect proof of the latter part of the condition can be made 
readily.” 

In Paper I the author has shown how the finiteness of the bound B(£) is 
equivalent to the Legendre condition in the special case of the ordinary problem 
of the calculus of variations. Further, the condition above in conjunction 
with the Jacobi-Mayer condition implies that the bound B(£) is both finite 
and non-negative. The problem of obtaining sufficient conditions has been 
solved only for the restricted case which is considered in the author’s thesis. 


Tre UNIVERSITY OF CHICAGO. 


2! See Paper I, p. 23. 














ON THE ZEROS OF JACOBI POLYNOMIALS, WITH APPLICATIONS 
By M. 8S. WEBSTER 


Introduction. ‘This paper deals principally with the generalized Jacobi poly- 
nomials 


Jn(x; a, 8) = J,(x) = (1 + 2) (1 — x)" = (a + 2)" — 2] 


(1) = (—1*nt(?" = _ = *) ala; a, 8); 


on(z; a, B) = o, (x) = 2" — S,r™'* + --- (n = 0,1,2, ---), 


defined (except for constant factors) for all real a, 8 as the polynomial solutions 
of the differential equation 


(1 — 2°)Jn() + [a — B — (@ + B)z]J4.(2) 
+ n(in+a+ 6 — 1)J,(x) = 0 (n = 0,1,---). 


(2) 


For arbitrary a, 8 several authors have discussed the number of real zeros of 
J,(x; a, B). Stieltjes [1]' gave a method of finding the number of zeros in the 
intervals (— 2, —1), (—1, 1), (1, ©) but he stated the result only when a, 8 > 0. 
Shibata [2] gave a table for the number of zeros when they are all real and a, 8 
are not negative integers or zero. Lawton [3] gave complete results for the 
closed interval (—1, 1) when n is sufficiently large. The results of Hilbert [4], 
Klein [5], Van Vleck [6], and Hurwitz [7] for the zeros of the hypergeometric 
function may also be applied to Jacobi polynomials. 

Here we find the number of zeros of J, (2; a, 8) inside the intervals (— «, —1), 
(—1, 1), (1, @) and, in addition, at = +1 (from which the number of imagi- 
nary zeros is easily obtained) when a, 8 are arbitrary. The method employed is 
new and other properties of J,(x; a, 8) are developed as well. 

In case a, 8 > 0, the J,(x) form, as is known, an orthogonal system 


(3) [ _ P(x) I n(x) In(x)dx = 0, p(x) = (1 +2)" "(1 — 2)" 
(m # n;m,n = 0,1, --- ). 


Received December 7, 1936; presented to the American Mathematical Society, April 20, 
1935. The author wishes to express his gratitude to Professor J. Shohat for his valuable 
suggestions. 

1 The numbers in brackets refer to the bibliography at the end of the paper. 
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Here the zeros 2;,,(a, 8) = 2, (¢ = 1, 2, ---,m) of J,(x) have the following 
properties: 
"uw —l, Zan —l(n— o); 
—| < Tint < Tin < Z2,n+1 qe < Lawn < Ln+ivnil <4 


We obtain upper and lower bounds for z;,, by the geometrical method which 
E. R. Neumann [8], Winston [9], and W. Hahn [10] used for Laguerre and Her- 
mite polynomials. We further employ a superior method, based on Markoff’s 
theorem [11], which enables us to find better bounds and even asymptotic 
expressions (n — <) for certain z;,,. We then apply these results to the co- 
efficients H;,, in the mechanical quadratures formula 


1 " 1 
™ : fp __ Pa )on(x)dx 
(4) [, p(x)f(x)dx = > inf (tin) + RAP), Hi... = [ Go a rey 


with the characteristic property that 
1 n 
i P(X)Gon a(x) dx == } Hi 5. Gon-( Xin), i.€., R,(Ge,,. i(r)) => 0, 
—1 i=l 


G,(x) denoting an arbitrary polynomial of degree < s. 

We consider also the coefficients H;,,, for Laguerre polynomials. 

Finally, we establish the theorem, recently proved in a different way by 
Meixner [12], that the system of Hermite polynomials is the only case where 
systems of orthogonal and Appell [13] polynomials coincide. 

1. The following relations, being identities in a, 8, remain valid for the gener- 
alized J,(2) (as well as for the orthogonal case), n = 1, 2, ---. 


(5) Ji(z;a, 8) = —n(n+ a+ 8 — 1)J,u(z;a+ 1,8 + 1) 
J.(—2; a, B) = (—1)"J,.(2; B, a); 
Lina, B) = —Zn-i+1,n(B, a) (2 = |, 2, kts n) 
(7) J,(—1; a, B) = 2-nt(" + . = "Y.; Ju(ls B) load (—2)"n1(" + r - '\ 


With Lawton, we let y(z) = (1 + 2)"**"(1 — 2)"**", so that 


(6) 


d" n-l 


T+ 2w@)l = +2) © ye) +n ve), 


In virtue of (1, 2), we obtain by successive differentiation 
(8) [n+ a+ 8 — I][J.(x; @ + 1, 8) — J,(z; @, 8)] = (x — 1)J2(z; a, 8), 


(9) [n(x + 1) + 2a] J,(z; @ + 1, 8) — 2(n + a) J,(z; a, 8) 
= (2* — 1)J4(z; a + 1, 8). 
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By repeated use of (5, 7), we have 


n n 


Jala; a, B) = DIS(-1; 4,8) ZT = YO ye + 0) 


=0 sf i=0 
(10) ~ 
=n! 2, (—1)'2" eka dle ee eS 


Hereafter, r represents a negative integer or zero; [z] = 0 if x < 1, and equals 
the largest positive integer < x if x = 1. Introduce non-negative integers 
P, 7, mo as follows: 


p=([l-—al, g=[1—6]; m=[l1-n-—a-—ASlifnt+at+6 Sl. 


Let N; (j = 1, 2, 3) denote the number of zeros of J,(2; a, 8) (assumed & 0) 
in the intervals (— x <x < —1), (—1 < 2 < 1), (1 < & < @) respectively, 
and K; denote the number of zeros of J,(x; a + 7, 8) (80) in(— ~ <2 < —1), 
where the smallest and largest such zeros (if they exist) are \;, n; respectively. 
Then, Ko = Ni, S ni < —1. By (2, 10), 


(n—i)(n+itatB— Ivit 204+ IG F+ arin = 0 
(i = 0,1,---,n— 1). 
Substituting J,,(2; @, 8) = (1 + x)" “g(x) in (2), we obtain 


(11) 


J,(z;1 — p, 8) = (-0"-pt(" * alc + 1)’ Jn—p(x; 1 + p, B) 


P 

(n 2 p), 
(12) J,(z;a,1—q) = (—-at(" + ‘ x: ') (x — 1)‘ J,_,(23 a, 1 + Q) 

(n 29), 


J.(a;1 — p, 1 — q) = (-1)""(p + ar( ed .) (x + 1)’(x — 1)" 


Jrnralt;l+pi+@ (n2pt4). 


These formulas show the effect produced by the indicated changes in a, 8. 
It is interesting to observe that the orthogonality relation (3) (derived for 
a, 8 > 0) is still true if a, 8 are non-positive integers provided m,n = p + q; 
for example, using (12), we have, by (3), 


1 
(1+ 2x) 7°01 — x) “J,(2; 1 — p, 1 — qg)J,(a; 1 — p, 1 — gdr 
a 


om 1"? m n yf p — As: 
((p + 9)! (, " MM, “s x + x)*(1 — 2)" J n—pq(t; 1 + p, 1 + 9) 


*Jn—p-o(4@3 1 + p,1 + qgdzr = 0 (m,n 2 p+ q). 
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Formulas (2, 6, 10, 12) yield the following properties of J,,(x; a, 8): (i) it = 0 
if, and only if,n+m+at+B=la+p=1,n =p > m; (ii) it has no mul- 
tiple zeros, except possibly at r = +1; (iii) it has a zero of multiplicity p at x = 
—1 if, and only if, a + p = 1, n 2 p; (iv) it has a zero of multiplicity q at 
xz = 1 if, and only if, 8 + q = 1, n 2 q; (v) it has a zero of multiplicity [n — mo] 
at infinity if, and only if, n + m +a+68=1. 

In what follows we may assume n > 0 and J,,(x; @, 8) * 0. 


2. For the further study of zeros of J,(7) we make use of the following lemma 
{10}: 

Lemma 1. If f(x) * 0 satisfies the differential equation r(x)f''(x) + s(x)f'(x) 
+ t(x)f(x) = 0 in a certain interval (c, d), where r(x), s(x), t(x) are continuous 
and r(x)-t(x) < 0, then f(x) can have at most one zero inside (c, d). 

We now consider two cases. 

Casel. n+atBs>1. 

THreorEeM 1. If n + a+ 68 > 1, then Ne = [n — p — gq) and N,, Nz are 
each either 0 or 1. Furthermore, if a + p = 1, n 2 p, then N, = 0; otherwise, 
N,; = min (n, p) (mod 2)” If8+q=1,n =4q, then Nz = 0; otherwise, Ns = 
min (n, g) (mod 2). 

Proof. N;(j = 1,3) is 0 or 1 by Lemma |. 

(i) n Sp. (a) n < porn= panda’r. By (10,11), yivin > 0 ( = O, 
1,---,n — 1). It follows from Descartes’ rule of signs that Ne = N; = 0 
and N,; = n (mod 2). 

(b) n=p,a+p=1. From (12), N; = Ne = N; = 0. 

(ii) n <q. This reduces to (i) by means of (6). 

(iii) n > p,n > q. Hence, n = p+ q. Lawton’s method’ shows N» 
n—p-—gq. (6,10, 11) enable us to complete the proof as in (i). 

Case2. n+a+£82 1. It suffices to illustrate the method for 0 < n + 
+a+68<1,a*r. By Theorem 1, 


~ 


II 


(13) K,,4: is 0 or 1 and = min (n, [p — m — 1)) (mod 2). 


Let 7 be an integer such that 1 SiS m+ 1. If K; 2 1, it follows from (8, 9): 
(i) J,(x; a + 7 — 1, B) and J,(2; a + 7, 8) have no common zero inside (— ~, 
—1), (ii) J,(a; a + i — 1, B) has exactly K; — 1 zeros which separate the K,; 
zeros of J,(x; a + 7, B) inside (— x, —1), (iii) J,(z; a + 7 — 1, 8) has at most 
one zero less than \; and at most one zero inside (y;, —1). Furthermore, making 
use of (1) and of the fact that J,(x) is a polynomial, we get: 


2 ¢ = min (a, b) (mod 2) means c = d (mod 2), where d = min (a, 6). 

3 For n 2 p+ q+ 1, Lawton proved that J,(2; a, 8) has (i) exactly n — p — q zeros 
inside (—1, 1), (ii) a zero of multiplicity p at z = —1 if a + p = 1, (iii) a zero of multi- 
plicity qatz = 1ifg8+q=1. 
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Kiu=Ki+1(i>p), Ki (sp<n+i, |Ki-1| m+isp) 
< 


(1 iSnm+i1 Sn), 


(14) Kiiui=Ki @>p), |Ki-1| (@Sp<n+i), Ki (n +i p) 
(lsiSnm—n+I1,n S nm), 

Kiu=Ki+1>p), Ki Gsptnt+i, |Ki-1| m+isp) 
(mn —-n+2S5i1S5m+1, S m). 


(14) remains valid if K; = 0. By a repeated application of (14) and (13), we 
derive the following results: 

(i) n2m+1. (a)p2z>mt+1. MN isO0orl and = m + 1 — min (n, p) 
(mod 2). (b)p<m+1. M=m—pt+l. 

(ii) n Sm. Ny = [n — pl. 

We summarize our conclusions in 

THEeorEM 2. (i)0<n+m+atB8<l. [fn 2m+1,m+1 < pand 
a#r,ora+p=1,n < p, then N, is 0 or 1 and = m + 1 — min (a, p) 
(mod 2). [fn 2p >m+lat+p=1,thnN,=0. Ifn 2m+1 2p, 
thnNi=nm—pt+l. IfnSm,thenN,=[n—pl. (ii)n+mt+at+ 8 = 
1. Here, N; = [min (n, m) — pl]. 

THEOREM 3. Ifn+a+ 8 S 1, then NeisO0or 1. 

Proof. lin + a+ B <1, this follows from Lemma 1; if n + a + 6 = 1, 
we use (2). 

Analogous results regarding N3 are obtained (see (6)) by interchanging a and 
8,pandgq. Theorem 2 and the corresponding one for N3 yield 

Corotuary. (i) Jf 0 < n+ m+ a+ 8B < 1, tt is impossible to have both 
Ni > 1, Ns > 1; if, in addition, a = B, then Ni = N3 = 1. (ii) Ifn + m+ 
a+ B = 1, then at least one of the numbers N, , N; is zero; if, in addition, a = 8B, 
then N, = Nz = 0. 

Ifn + a+ 8 < 1, we have determined the number of real zeros of J,,(2; a, 8), 
which do not lie inside (—1, 1). Hence, we may find Nz from Theorem 3, 
since the total number of real zeros is even or odd according as n is even or odd. 
Illustrations, such as the one below, show that the conclusions of the above 
theorems and corollary can not be improved. We note that these results differ 
from the case of Laguerre polynomials [10] in that we may have either N, or N; 
greater than unity. 


Illustration: n = 3, a = 7 B=- Here, p = 1, g = 4, m = 2, 


J3(x) = = {3°-5(x + 1)° + 2-3°-5°(2 + 1)* + 2-3°-5(a + 1) — 2-3-5}. 


Since the discriminant of the equation J/3;(x) = 0 is positive, all of its roots are 
real; thus, N; = 2, Ne = 1, Ns = 0, in accordance with the preceding theorems. 
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3. Hereafter, our analysis is confined to the study of the zeros z;,, (all inside 
(—1, 1)) of J,(2; a, 8) for the case a, 8 > 0. It is known [11] that 
Zinla, B) < Linla + 1, B) < Fissn(a, B) (j= 1,2,---,n— 1); 
furthermore, by (8, 9), 
(n + a + B)(x + 1)Jn(z; @ + 2, B) = [(2n + a + B)(x + 1) + 2a] 
-J,(z; a + 1, 8) — 2(n + a@)Jn(2; @, B). 
Thus, we obtain the following interesting inequalities, to be used later: 
Linla, B) < Lin(a + 1, B) < ainla + 2, B) < Zissn(a, 8), 


(16) ty nla, B) < List.n(@, B+ 2) < Lisa.n(a, B+ 1) < Tisin(@, 8), 
(¢ = 1,2,---,n — 1). 


(15) 





By means of (10), we see that z,,,(a, 8) — —1 + “ig ; z if B— «, he 
k (= 0, finite or infinite). In particular, fork = <, 
Xinla, B) life « (i = 1,2,---,n). 


An interesting relation for the z;,, may be obtained by putting ¢,(z; a, 8) = 
o,(x) = x” +--+, where 


1 
[ z* (1 — x)” "$,.(x)o,(x)dx = 0 (m ~ n; m,n = 0,1, ---). 
0 


If we make the substitution x = (1 + y)/2, thus reducing the interval (0, 1) to 
(—1, 1), and compare with the orthogonality relation (3) for ¢,(2; a, 8), we find 
that ¢,(z) and ¢,[(1 + 2z)/2] differ by a constant factor only. Thus 


1+.) _ _ 1+ Zin 
2 P — 








$n(2) = 2°3.( Lin 5) ’ 


where Z;,, = Z;,.(a, 8) are the zeros of $,(z; a, 8). Making use of the known 
relations [11]: 
ai on(B, B) = Ei_n,n(3, B) (n + 1 < 1 < 2n), 
a7 2n41(B, B) = Fin-1.n3, 8) (n+2 571 5 2n+4 1), 
we get 
on 1 + Xi,n(}, B) = 22, +:.2n(8, B), 1 + Zi,n(2, B) = 22? 4:+1.00418, B) 
(17) 
(7 = 1, 2, aes »n), 
interesting relations connecting the symmetric case (a = 8) with two particular 
non-symmetric cases corresponding to the same interval (—1, 1). We shall 


make important use of (17). 
With W. Hahn, let 
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= 1 = 1 1 Pi 
ae ~~ oo ee er. 
mS ita,’ em? 0+ a’ —r Laces P2 
Evaluating p; , pe by means of (10) and using (6), we get 
2a 2a(a + 1) 
a a F . Bs : ; 
nnta+B—1)~ +" <nin+a+B—1)+ala+6)’ 
28 28(6 + 1) 
18) ———_—_——, < 1-12. < ———————_ |; . 
( ) Intets-D F n(n +a+ 6 — 1) + Bla + B) 
Lin = —1 + o(4), Lin = 1 oa o(*) (a, B > 0, n- 20). 


These results may be greatly improved for some special a, 8 (see (32))." We 
note also that 


2 
1+ fn = | o> - 1) + a(t) (a + 0), 
2 
1 — Lian = Al a + a-t) a w(t) (B — 0), 
r1,n(0, B) = —1, Tninl@, 0) = |] 


(see (12)). 
From (17) and (18) we obtain 
na fe 
—s v2 < —2n.gm = Fniton < = 3 ees 
2+/m(m + a — 3) 2Vm(m + a — 3) + Ha + 4) 


(19) 
aos ——— v6 = < — Lm .2m+1 => Limi 2.Qm41 < = a ae 
2+/m(m + a + 4) . 2 m(m + a + 3) + Sa + 3) 
(a=8>0). 


In some cases these results may be improved (see (29)). 
4. We proceed along the lines of E. R. Neumann and Winston to obtain 
bounds for the general z;,,. (5, 6, 8, 9) yield 


a Tae) = _(2n tet B)(x + 20) Ji(x) — (2n + a + BJ, (x) 


- J nai(t) = . 
(29) "+! nta+p—-1 
(« al a es. ) 
~ *+e+8;" 
2 
(21) (2n + «+e r) Jia(z) = —(Qn+ a+ B)(zr + 2o)J nsi(x) 


— 4(n + a)(n + B)J,(2), 


4 It is to be noted that these results were obtained before the publication of Buell’s [14] 
paper which supplements but does not supplant formulas (18), (19), ete. Formulas (31), 
(32), ete. were not given by Buell. 
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Jnu(z;a— 1,8 — 1) = (1 — 2) 1(a; a, B) 
+ la — B + (a + B — 2)2) J (2; a, 8). 


Let ui..(a@, 8) = win (@ = 1,2, ---,n — 1) be the zeros of J,(z; a, 8). In view 
of (5), it is known that 


win(a, B) = Linala + 1,8 + 1), a < { ~ } < rg 


(22) 


Mi-1,n) Mi,n+l Mi,n 


where either of the terms in each bracket may be used. It may be shown by 
induction on ¢ that Jn(2i.ns1)-Jnsa(ti.n) < 0, so that (20), (21) give 


= . = 
Lin S Ming ACCOrdiNg as Xin = —Zo, 


> : s 
Litinit S Min ACCOTAINE AS Zizi nus > Lo. 
We see from (2) that there is no point of inflection of the curve y = J,(z) inside 
(2i,n, Hin) if Tin S at and no such point inside (ui.n, £é41,n) if ti. 2 — B 
a 


a+, 
By (20), (21), (22), let 
I, = (Qn + a + B) [ sic J (addr 


= n+a+B sa 1 : a 
= (n + 1y(n gers Ras 2) [J naa tin) J nt (2is1.n)], 


n+a+8-1 ; " 
™ 93 (n + 1)(n + a +8, a 2) | J ni (25,0) |. 
‘Ifa. 8 a—Bp then Th) is less than the area of the triangle formed 


~ a+,’ 2n+a+B 
by the z-axis, the tangent to y = J,(x) at = 2;,,, and the perpendicular 
to the z-axis at x = x;,;,, , so that by (20, 21), 





age Ao a = S00) | FCs) 
_ (n+a+B — 1)(ri41. — Xivn) | 
2 ~2(1 — 2?) pane 
us , = § 
Hence { similarly if x; =“ ), 
ence (simi arly if x;, = sae 
a 2 = tin) ' ee —— 
a Strtie = Sie > Fe 1)(n cio,» | ; iat a+ 8’ 
ee | 20 — Tis1.n) ii fun 2 = =. 
i+l,n nm (n+ 1l)(n+ a+ 8 — 2) . _— a+, 
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The inequalities (23) lead to the following bounds for 2;,, : 





. (i +2Ve2—-1)° af asp a 

oy FV Fat B-D ax’ “eae <=" 

(n—i + 2V/p2)° ap a—Bp ‘ 

bates Cm eae eae, te oe, > 3. 
in ss~ ee eetete-8 a Oe 


, 2. ‘ 2 
a, = min (1,22, , 6; = min (1, * ), a. = max (a,w), 


B. = max (8, w), w= ( + v3) = 0.364 ---. 


It suffices to outline the proof for the first inequality (24), when a 2 8. 





with 0 < 28/(a + 8B) < 1. Hence, by (23), 


meiaas) 21 + ain) 28} 
(1 + Tist.n) (1 + Lin) > E + I(nt+at+ B— 9)" a a+ e |. 


Casel. a Sw. Since x; < —— B it follows that 1 — 2,,, 2 28/(a@ + B) 
a+p 





eu >at Vin, a ett int ina +8— Bl + m,0- 





28 
Following E. R. Neumann, we prove (by induction) that 
p> a= (+ 2vVe— =, 
4 
at least ifn 23. Thus, 
‘eae See Se > (n = 3)., 


4(n + Dn+a+B—-2) atB 
Case 2. 0 <a <w. Here, we prove that 


pi =<. o+6 i, +I(in+a+8-—2)142;,,) >o = =O +2Vo— 1" 
a 28 4 
G@+2Ve-1) a 2B (n > 3) 

4(n+1)n+a+B6—2) wo at+B psioats 


Continuing with E. R. Neumann’s method, we may, by means of a com- 
plicated analysis, find certain upper and lower bounds for the negative z;,, in 
the symmetric case (a = 8) as well as in the non-symmetric case. Instead of 
giving these results, we proceed to develop a method which yields much more. 





1 + Tin = 


5. Since 2;,n(a, 8) for a, 8 = 1/2, 3/2 are known [11] explicitly, Markoff’s 
theorem® immediately gives 


id OZin Oz, ’ Ox: ” 
(a, B) >0, — a(a, 6) <0 (a #8); - aoe 8 > 2 0 according as Z;,» 5S 0. 





Oa 0g 
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2-1 . 
<~-+,. < —cos < <3: an a 
on i rs Xi,n(a, B) = Os on vr (3 = a,B => 35;t 1,2, ,n), 


eit + (5). 


We may generalize this result for all a, 8 by means of (16), Markoff’s theorem, 
and [11] 


(26) T1,n4i(a@, B) < T1,n(a@, B + 1), Tn n(a@ + 1, B) < Tnzinqa(@, B). 


For example, if a, 8 = 1/2, there exist non-negative integers r, s such that 


(25) 


ti.n(a, B) = —cos 5 





} Sa—2r—o% <i, $= 8—2-—-a<} (01, 2 = 0, 1). 
Ifs+1<i1sin-—r-—1, then 
Ti-s-1n(a — 2r — 01, B — 2s — a2) S Xj-s-1,n(a, B — 2s — ae) 
< ti-sn(a, 8B — 28) S Xin(a, B) S Tisrnla — 2, B) < Lirest.nla — 2r — 04, B) 
SZ Lisrstn(a — 2r — a1, B — 2s — an). 
Hence, for all a, 8 = 1/2, 


2 on bot < 2x;,,(a,8) < ngage + 2r +2 


“ Ber! auW— oe o 


(} Sa — 2r—a1 < 3,3 SB — 2s — a2 < 33.01, 02 = 0,1; 





s+t+l<itisn-r-l). 
It follows that 


(28) Xin(a, B) — —cos tr if - —tasn—-« (a, 8 > 0). 
Markoff’s theorem and (16) give, further, 


_ V2 < 
r m+1,2m m- m\3,2) = s < 
2im +a -%) < Zm+1, (a, a)<s Tm+1,2 (3 3) in Im : 9 


Rg 
IIV 


. Tv 
sin 4m => Lm+1,2m(3, 3) « Tm+t,2m(Q@, a) < Lm+1,2m(F, a) 


3 3r 
< Im42,2m(4, 4) = sin < — = (0 <a < }), 


(29) . 
=. 9 
2m +at)) 





° TT 
< Lm42,2m41(Q, a) = Lm42,2m41(3, 3) = sin am + i < 2m * i 


T 
n om +1 = Lm42,2m41(3, 3) < Tm42,2m41(a, a) < Lm42,2m+1(}, @) 
2r < 2r 
2m + 1 2m +1 





< Lm43,2m41(3, 3) = sin (0 <a < }). 
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If a = 3, the upper bounds in (29) are better than in (19). Moreover, 


vin (atte T 


Pea 6 5) = Tm+ite.tm+e(3, 3) < Ln+i4e,tm4e(a, a) 


2m+1+e 
(30) 2i-l+o-r 
= m+i+o,2m+ea 3,3 pt a ee See ters 
=> Lm4 2 +e(3 ;) = sin ite > 
(} Sa Ss 3;0 =0,1;¢ =1,2,---,m). 


We thus obtain the following asymptotic relations: 


Im-ti+e2m+e(a, a) = sin (Rott. *) -+ o( 4 


2m +1+ m? 
(31) (} Sa S3;0=0,1;1>0), 
2m+1+oa 8 4 en ee 
di - i +e . Im+ise,2mt+e(Q, a) 2 if — QO asm oi 
Making use of (17), we have 
2n+1l+o a os 
[ett [ieee + a, B)| - if = +0 asn-oow 
(3 =ss 3;¢ = 0,1), 
(32) 2 
n(d,6) = —14 2% ogy = 147 t Osesp, 
8n? 2n* 
suse, 9) = 1 4 tO) rasa) = 1 EO) (b <a <3) 
In view of (16, 17, 29, 32), we have 
— nf} oe l 
(33) "| [1 + 2i,n(a, B)| > 3 if i;sn— %, =e +0 (0 <a, 8 S 3). 


Although derived for a, 8 S 3, (33) remains valid, in view of (16), for all a, 8 > 0 
(as in the case of (28)). 


6. We apply the foregoing bounds for the zeros x;,, to the mechanical quad- 
ratures coefficients H;,,(a, 8) = H;., (see (4)) by means of the Tehebycheft 
inequality 


(34) O<Hina< [ . p(x) dx (ton = —1, Tn4in = 1;i = 1,2,-+-,n). 
It is known [9] that H;,.(a, 8) = Hy-is1.»(8, a). Also, by a method similar 
to that which leads to (17), we get 


Hin(3, B) = 2 Hay i2n(B, B), 
(35) 


H;,,.(3, 8) = Phy? sia tenga(B, B)H n+ ist,2n1(B, B) (2 = l, 2, ies n). 
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In what follows, we obtain an upper bound of the order nF for all H,;.,, such 
n 
that 1 < i S C (arbitrarily fixed constant) in which case Winston [9] gave a 


lower bound of the order = if a > 3. Hence, the true order, with respect to n, 
of Hy» 18 a = (@ > 2,1 <¢* SC). It is sufficient to illustrate the procedure in 
We have (by (34)), 

Hii. < (Tisr. = Ti-an)( + Ti-r.2)* (1 = Zistn) ; 


— 


the spec ial case} Sa, 8B < 


(Using the fact that p(x) is decreasing for —1 < x < - <5 and increasing 
Qa 


for et < x < 1 does not give any essential improvement.) By (25), 
a+fp-—2 


H..< gate -1 sin E — | 5] sin] 5 4] om 2i a’ | a 
so 2n+1 2] 2n+12]1 2+1 2 


242 pum ] a , 
| cos 3 3 = 0( —_—__— if 7 = O(n‘) (OS€e< 1). 


n2a—e(a—1) ’ 











Likewise, if 0 < a = 8B < }, then Hasiamse = o(+), fl sisC (oe =0, 1). 


Since Winston gave a lower bound for H;,, of the same order, this is its true order. 

H,,, and H,,,,, may be treated as follows. ((34) gives an upper bound in case 
a 21 but p(t) — ~ asx — —1 for0 <a< 1.) Fora > } and n sufficiently 
large, Winston showed that H;,, < He,,. He gave also an upper bound of the 


1 , : : : 
order —- for H,,, if a S 3, except when 8 < @ in which case H,,, < Hz, , so 
a ‘ 


] — : i , , 
that H,,, = o( 4). On the other hand, if ¢,(2) is the normalized Jacobi poly- 
n= 


nomial of degree n, 


a a 2° P@)P(a + IP(m + DP(n + B)_ -0() 
ee” Baw” ER Tint+at1)rin+a+8) 


ne 


=O 


[11], since K(x) = (= ei) < Oforx <2,,. Hence, the true order of H,,,, ts 


- foralla >0. Likewise, the true order of Hy... % 3 





The above upper bounds enable us, in case the behavior of f(z) near r = +1 
is properly specified, to estimate how much the mechanical quadratures formula 
(4) is affected if we omit even infinitely many of the terms H;.,.f(z;,,.) correspond- 
ing to the z;,, near +1. 

Illustration. Consider f(x) such that 
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| f(z) | = (asp) (0<1—{2| < 6; 4,1 positive constants). 


Then we can omit in (4) all terms H;,,,f(z;,,) corresponding to 
(36) i < Cin‘ ori > n — Czn‘ (C; > 0, C2 > O, € given constants, 0 < « < 1), 


provided’ 8 = a, 2a — «(2a — 1) > 2l. In fact, for ail such 7, | 2), | > 1 
(n — ~) (see (27)), so that, by (18), for n sufficiently large, 


' 2a 
0<1— |x| < 4, t — [Sal > ea 


Thus | De HinS (tin) |, extended over all i as given in (36), is = o( a ) 


n2a—e2a—1)—2 


—Oasn— ~, 


7. The method of the preceding section can be applied to Laguerre poly- 
nomials, in which case p(x) = 2*e* (a > 0) and the interval of orthogonality 
is (0, «), 

(i) 0<a@<1. Using (34) and the bounds for z;,,, given by Winston, we get 
(it+2 /aa—2)2 a as i 
Hence De fit BV = a\" 

oe 4(n + 1) ae} 

{6G +a + 1)*(n+1) ae } 
\@F Qe —2¥n+a) a $f" 


( 2 
2 = max a, (-+¥9)} (§ = 2,3, ---," — 1). 








(ii) a > 1. 


i+2+/a—2)? . i , 
Hin < ow wr [4G tat \*f, _ @+2Va— 2)*(n +a) 
én n+a \ 16 + a+ 1)*(n + 1) 


(¢ = 2,3, -+-, 2 —1). 





In particular, 


ae o(2) (1<i<C,a>0), 
; i C*a 

Hin = O(e"" n*) (o<cst1=S*,a>0), 
n 4a2 


(Since H,,.. < Hn-1.,, at least if n is sufficiently large [9], this remains valid for 


* This restriction is not essential, since we may always interchange a, 8. 
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i = n.) Winston gave a lower bound of the order 2 for H;., (a > 4;% = 
ne . 
1, 2, --- ,n) and showed that’ 
1 . 
Hi. = o() (0<aZ}4); Ain»< He, (a> 4,n sufficiently large). 


Here, [11], 





= Forint» = o£) 


Hence, the true order of H,,, is -; (a > 0) and the true order of H;,,, is i (a > }, 
a n= 


1<:<C). We note, however, that H;,, is not of the same order with respect 
to n for all 7. 

In accordance with the results of Winston, similar results could be obtained 
for Hermite polynomials. 


8. The above classical orthogonal polynomials of Jacobi (J), Laguerre (L), 
and Hermite (H) possess the remarkable property, important for the preceding 
discussion, that the derived polynomials are again orthogonal polynomials 
J, L, and H respectively, but with new parameters. For Hermite polynomials, 
we have a still more remarkable result: $n (2) = non-1(2), i.e., the weight func- 
tion remains unchanged, or, in other words, Hermite polynomials form a system 
of Appell [13] polynomials. We close by giving a simple proof that the Hermite 
polynomials are the only orthogonal polynomials with this property.” 

TueoreM 4. Jf {¢,(xz) = x” + --- },n = 0,1, --- , is an orthogonal system 


of polynomials, i.c., Gm(x)bn(x)dy(x) = 0 (m # n, ¥(x) monotone non-de- 


creasing), such that @,(x) = n@n-s(x) then {@,(x)} is reducible, except for constant 
factors, to the Hermite system {H,(x)} of polynomials by means of a linear trans- 
formation on x. 

Proof” Write with Appell [13] 


n(n—1)  a-2 
= 


n 


mt oe 


(37) (x) = 2" + cana aa G,i1t + 4, 


where a, @2,°**,@,,°** are given constants uniquely determined when 


7 Winston’s result (corrected for a slight misprint) is 
T?(a@)I'(n) 1 . 
i; 4" = — O0<aS};i =1,2,---,n). 
“= Nate) wp 6 KeSEE 
8 The same result has recently been derived by Meixner [12] as a consequence of more 


general considerations. 
® The present proof was obtained before the proof recently published by Shohat [15]. 
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\¢n(z)} is given. It follows (by induction for n > 1) from the fundamental 
recurrence relation [11] 
dnsi(x) = (x — Crs rn (2) — Xs +19n—1(2) (n = 1,2,--- ) 

that 

o,(—2 — ay) = (—1)"¢,(x = a) (n = 0, 1, eee ). 
Hence, ¢:(2 — a) contains only even or only odd powers of x, so that 

dbomo(X per a) sa :® see y™ i)+e 

i=0 

(38) a. (¢ = 0, 1; C;,,, constants) 

d2m+0(X) = +B Ciel + a)" ” aa 


i=0 


Comparing (37, 38), we obtain the important relation 


dense(X) = > C; Pa °) (x as ay it+e 


(39) i=0 2 
(o = 0,1; m = 0,1, --- ; C; constants), 
where the C; are functions of the a; (¢ = 1, 2, --- ) and do not depend explicitly 


on m (Co = 1). We note the additional property of (39) that only even or only 
odd powers of x + a occur. Moreover, by the orthogonality property, 


/ (7 + ay) o,(x)dy(x) = | x’ d,(x — a)dp(x — a,) = 0 


(40) - 
Letting 
8; = / (a + a)'dy(x) = / rdy(x—a), Bo>O (2 = 0,1,---), 


we have [11] 


Bo Bi - B, 
Bi Be -** Buss ie + Br | 
Hema) ae II 
a ee Br Bon» 
Ll weer x 
We see that the sequence ¢,(¢% — a), n = 0, 1, --- , is a sequence of symmetric 
orthogonal polynomials whose moments are 8), 8:,---. If we take n = 1, 2, 


- successively in (40), it follows that 


(41) Beisr = 0 (i = 0,1,---) 
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Substituting (39) in the first integral (40), we obtain now (for « = 0, 1) the 2m 
equations 


2 2 
Cm Bo + ( “9 m—1B2 + -** + Bo o »)¢ 1Bem—2 + Bom = 


(42) (2m + 1)CmBe + P*; 7 ') CmaBi + ++ + ae : ') C1Bem + Bans = 0, 


“a(n ') CiBim-2 + Bim = 0. 
In particular, 
C1 Bo + Be = 
By means of (42), we may eliminate C,, C2, -+-- ,C and express Ba; (i = 
2, 3, --- ) uniquely in terms of 8), Be. Evidently, we may assume p fixed 


(say = 1). Hence, 6, C; (¢ = 1, 2, --- ) are determined as soon as f: (> 0) 
is given. 

It is clearly sufficient to show that all systems of polynomials given in the 
form (39), which satisfy (40), are reducible to one system. Let {¢,(r)} be 
such a system where a; , 82 have certain given values d; , B: respectively. By the 
preceding, §2;, C; for the system {¢,(x)} will have certain values B;, C; (i = 
1, 2, --- ). -Consider now another such system {¢,(x)} determined by different 
a;, Be. Set Bo = cB (c = certain positive constant). Then, by induction 
from (42), we get for the 2; , C; corresponding to the system {¢,(r)}, the im- 
portant relations 


Bos = ¢° Bas, Ci = eC, (i = 1,2,---). 
It follows by (39) that 


domye(t + dh — Mh) = ) C; ry 4 ’) (x + & — a, + a)” ite 


i=0 
we > cd, > ’) (x + 4, y2m—i) +6 
i=0 


entre > C, - a y : ae 


or, again, if we apply (39) to dens0(x), 
dom4o(Cx + cd, — a) = 


entre 


dem+o(Z). 








9 


1. 


15. 
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Thus all systems of orthogonal, Appell polynomials are reducible, by the linear 
transformation x/cex + cd; — a, to one system, which is necessarily the system 
of Hermite polynomials 


. —1i1) «2 —] —2 —=— 2) 
H (x) =it- a ar a n(n ee ) x i ee F 


since the latter is a system of Appell polynomials. 


BIBLIOGRAPHY 


Tu. Srieirses, Sur les polynomes de Jacobi, Comptes rendus, vol. 100 (1885), pp. 620- 
622. 


. K. Surpata, On the distribution of the roots of a polynomial satisfying a cerlain differential 


equation of the second order, Jap. J. Math., vol. 1 (1924), pp. 147-153. 


. W. Lawton, On the zeros of certain polynomials related to Jacobi and Laguerre poly- 


nomials, Bull. Amer. Math. Soc., vol. 38 (1932), pp. 442-448. 


. D. Hitsert, Ueber die Discriminante der im endlichen abbrechenden hypergeometrischen 


Reihen, J. fir Math., vol. 103 (1888), pp. 337-345. 


5. F. Kiem, Ueber die Nullstellen der hypergeometrischen Reihe, Math. Annalen, vol. 37 


(1890), pp. 573-590. 


3. E. Van Vuieck, A delermination of the number of real and imaginary roots of the hyper- 


geometric series, Trans. Amer. Math. Soc., vol. 3 (1902), pp. 110-131. 


. A. Hurwitz, Uber die Nullstellen der hypergeometrischen Funktion, Math. Annalen, 


vol. 64 (1907), pp. 517-560. 


. E. R. Neumann, Beitrége zur Kenninis der Laguerreschen Polynome, Jahresber. der 


Deut. Math. Ver., vol. 30 (1921), pp. 15-35. 


. C. Winston, On mechanical quadratures formulae involving the classical orthogonal 


polynomials, Annals of Math., vol. 35 (1934), pp. 658-677. 


. W. Haun, Die Nullstellen der Laguerreschen und Hermiteschen Polynome, Schriften des 


Math. Sem. und des Inst. fiir angewandte Math. der Univ. Berlin, vol. 1 (1933), 
pp. 213-244. 


. J. SHowar, Théorie générale des polynomes orthogonauz de Tchebichef, Mémorial des Se. 


Math., vol. 66, Paris, 1934. 


. J. Merxner, Orthogonale Polynomsysteme mit einer besonderen Gestalt der erzeugenden 


Funktion, Jour. London Math. Soc., vol. 9 (1934), pp. 6-18. 


3. P. APPELL, Sur une classe de polynomes, Ann. Se. Ee. Norm. Sup., vol. 9 (1880), pp. 119- 


144. 


. C. BuELL, The zeros of Jacobi and related polynomials, this Journal, vol. 2 (1936), pp. 


304-316. 
J. SuHonat, The relation of the classical orthogonal polynomials to the polynomials of 
Appell, Am. Jour. Math., vol. 58 (1936), pp. 453-464. 


THe UNIVERSITY OF PENNSYLVANIA. 











RINGS OF SETS 
By Garretr BirKHOFF 


1. Definitions. Following Hausdorff,’ a family § of subsets of a class J is 
said to form a “ring” if and only if it contains, with any two sets’ S and 7, 
their swm (or union) S U T and their product (or intersection) SN T. Clearly 
a ring contains, with any finite number of subsets S,,---,S,, their sum 
S, U--- U S, and their product 8; N--- A S,. 

The family § is said to constitute a “complete ring” if and only if it contains, 
with any subfamily S of sets S,, their sum V S, and their product A S,. 


acS aeSs 
The family § is also said to be a “‘o-ring”’ if and only if it contains, with any 
countable subfamily S of sets S, , their sum V S, and their product A S, . 


aeS aeS 
It is obvious that rings containing only a finite number of sets, and o-rings 
containing only a countable number of sets, are necessarily complete rings. 
These theorems can be improved by using chain conditions; however, the 
family € of all finite sets of integers is a countable ring which is not a o-ring 
(and a fortiori not complete), while the family © of all countable subsets of the 
continuum is a ¢-ring which is not complete. 


2. The importance of the subject. Rings of sets are mathematically impor- 
tant for a number of reasons. They are conceptually important because one 
can define them so simply in terms of two fundamental operations. They are 
also important because the sets of any class J carried within themselves by any 
one-valued transformation of J into itself are a complete ring. (The proof of 
this will be left to the reader.) Also, as is well known, the open and closed sub- 
sets of any topological space constitute rings, and the measurable subsets of 
any Cartesian n-space constitute a o-ring. 

Again, the reader will immediately see that 

(2a) The sets common to all the rings (resp. o-rings or complete rings) of any 
aggregate of rings of subsets of any class J themselves form a ring (resp. o-ring 
or complete ring). 

It follows that the closed subsets of any topological space = invariant under 
any group of transformations constitute a ring. The study of these rings is 
important in dynamics,’ where, however, the existence of minimal closed and 
connected constituents introduces special considerations. It follows also that 


Received January 16, 1937. 
? Mengenlehre, 1927 (2d ed.), p. 77. 
? We shall systematically use small Latin letters to denote elements, Latin capitals to 
denote sets of elements, and German capitals to denote families of sets. 
? Especially in the theory of so-called ‘‘central motions”. Cf. G. D. Birkhoff, Dynamical 
Systems, 1927, Chap. VII, §6 ff. 
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the subsets of any class J carried within themselves under any aggregate A of 
one-valued transformations 7, of J into itself form a complete ring. . 
Moreover, all complete rings of sets belong to at least one aggregate 1 of 
transformations in this way. More precisely, any complete ring 8 of sets 
belongs to the “groupoid’™ of all one-valued transformations carrying every 
S «Minto itself. This shows that rings of sets play the same réle in the theory 
of groupoids of one-valued transformations as is played by transitivity and in- 
transitivity in the theory of groups of permutations (one-one transformations).” 


3. Equivalent notions. If we add the empty set O and the all-set / to any 
complete ring of subsets of a class J, we still have a complete ring. Hence the 
theory of rings of sets is contained in that of rings containing O and J. 

THeoreM 1. The complete rings of subsets of I which contain O and I can be 
identified with the different quasi-orderings of I or with the different completely dis- 
tributive topologies on I. 


Explanation 1. By a “quasi-ordering”’ of J is meant a binary relation r = y 
satisfying 
Pi: x = x (reflexiveness), 
P2: x 2yandy =z imply x 2 z (transitivity). 
- 


By a “completely distributive topology” is meant a unary operation S —+ 8 
(called closure) on the subsets of J which satisfies 
C1: § 2S, C2: 0=0, c3: § = 8; 
C4: if S = VS,, then S = VS,. 

a a 
(These are related to well-known axioms of Hausdorff on “partial ordering’, 
and of Riesz-Kuratowski on closure.) 

Explanation 2. By an “identification”? we mean a one-one correspondence 
preserved under all permutations of the elements of J. It follows that if we 
call two families of sets of J resp. two relations on J resp. two operations in J 
“equivalent” if and only if there exists a permutation of the clements of J 
‘arrying one into the other, then the numbers of non-equivalent rings of sets, 
of non-equivalent quasi-orderings, and of non-equivalent completely distributive 


4 A family G of one-valued transformations of J into itself is termed a ‘“‘groupoid”’ if and 
only if it contains the identity «: x — z and the product or: x > z[o(zx)] of any two of its 
members o and vr. The author is preparing an article on groupoids in collaboration with 
S. Ulam. 

5 The sets invariant (i.e., the sets identical with, and not merely supersets of, their 
transforms) under any permutation or set of permutations constitute a ‘‘complete field” 

i.e., a complete ring which contains, with any set, its complement. Moreover, any 
complete field belongs to the group of all transformations leaving its subsets invariant 
(‘‘intransitive’’ on its subsets)—this leads to the usual partial descriptions of groups of 
permutations through their ‘‘transitive systems’’. 

Actually, in the case of groups of permutations of J, any subset carried within itself 
under all their permutations is necessarily invariant. 
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topologies on J are the same——as well as the numbers of distinct complete rings 
of sets, of distinct quasi-orderings, and of distinct completely distributive 
topologies. 

Proof of theorem. Let ® be any complete ring of sets containing O and I. 
Make the definitions: (1) « = y (®) means that every Se® containing z 
contains y, and (2) § is the product of all sets 7. ¢ R containing S. That the 
relation and operation so introduced satisfy P1-P2 and C1-—C2 is obvious; it is 
also obvious that the correspondence between them and 9% is preserved under 
all permutations of the elements of J. 

To prove C3-C4, recall that, since ® is a complete ring, S is the least set in R 
containing S. This proves C3 and 

(3a) S eM if and only if S = 8S. 

Now suppose S = VS,. Clearly S 2 S, irrespective of a; hence S =VS.. 


But conversely VS, €R, since R is a complete ring, and VS, = VS, = S; 
hence VS, = 8. This proves C4. 


It remains to prove that every quasi-ordering and every completely distributive 
topology belong to such an §®, and that distinct R determine distinct quasi- 
orderings and distinct topologies-—four assertions in all. 

By (3a), if NR ¥ MN’, then certainly R and MR’ yield distinct topologies. This 
proves one assertion. We next wish to prove that 

(38) Every completely distributive topology is determined by a suitable 9. 

Under any such topology, consider the family § of “closed” sets S = 8. 
Clearly § contains O, J, and (by C4) VS, if it contains every S,. But it also 
contains AS, under the same hypotheses. 


a 


Proof. If 8. = Sa for every a, then AS. = ASa , and so (ASa ) s S, for 
all S, , whence (AS) S AS, = AS« ’ ond so by Cl ASa is dunk Hence § 


is a complete ring of sets w vith O nd I. Moreover, if 8 is any set, then S is the 
product of the T,. « § containing S—by Cl, AT = AT. > §, and, by C1-C3, § 


is a closed set T, 2 S. Thus & “determine s” the given topology. This 
proves (38). 

Again, if 3 is given, then 

(3y) SeM if and only if ze S and x 2 y (R) imply ye S. 

Proof. If Se®, by definition the second statement holds. Conversely if 
the second statement holds, then S contains, with every z, the set S(z) of all 
y <= x (M)—ie., the product of the S, «R with z eS, ; obviously S(z) « R— 
and so S is the sum VS(z) of the S(x) of the ze S, and is in R. By (3y), if 
R = HR’, then R and R’ determine different quasi-orderings. 

Finally, every quasi-ordering p is determined by some ®t. For, given p, let 
R(p) consist of all S such that z¢«S and xz 2 y imply ye S. Clearly O € R(p) 
and J e¢R(p). Also, if a family S of S, is in R(p), then (VS,) € R(p) and 











446 GARRETT BIRKHOFF 


(AS.) eR(p). Thus R(p) is a complete ring of sets containing O and TJ. 
Further, if 2 y, then obviously z = y (M(p)) in the sense that x « S € R(p) 
and x = y imply ye S. Conversely if r = y (M(p)), then the set S(x) of all 
z < x (which is in R(p) by P2 and contains x by P1) contains y—by definition 
of x = y (MR(p))—-and so x = y. This proves the fourth assertion. 

4. The case of fields of sets. Which quasi-orderings and which completely 
distributive topologies correspond to complete fields® of sets? And what does 
this make Theorem 1 reduce to for fields of sets? 

THeoreM 2. In Theorem 1, a quasi-ordering corresponds to a (complete) field 
of sets if and only if it is an equivalence relation; a topology does, if and only if 
the closures of its points are the subsets of a partition of I. 

Explanation. By an “equivalence relation” is meant a quasi-ordering which 
satisfies 


P3’: x = y implies y 2 z. 


By a “partition” of a class J is meant a division of its elements into disjoint 
subsets, whose sum is J. 

Proof. Let ® be a complete ring of sets, and let S(z) be the product of the 
sets S,eR containing x. Then x 2 y (MR) means ye S(x). If ® is a field, 
and y e S(x), then the complement S’(y) of S(y) cannot contain x —otherwise 
reS'(y) N S(x) < S(x) — y < S(x)—and so y € S(x) implies x « S(y). This 
proves P3’. Again, topologically, S(x) is the closure of z. Hence if 8 is a 
field, unless S(x) and S(y) are disjoint, S(z) M S(y) contains some point z, 
and x = zand y = z, whence by P2 and P3’ x 2 y and y 2 g, and therefore 
S(xz) = S(y). 

Conversely, if P3’ holds, and & is the family of sets S such that x «8 and 
y S x imply y«S, then S eR implies that x not in S and y S z imply y not 
in S (otherwise y e S and x < y by P3’), whence S’ eR and & is-a field. The 
fact that the sums of the parts of any partition of J are a complete field of 
sets is obvious. 

Corotuary. The complete fields of subsets of I which contain O and I can be 
identified with the different equivalences on I or with the different partitions of I. 


5. Rings of sets and distributive lattices. We shall deal below with rings of 
sets without assuming completeness. 

Suppose we consider rings of sets simply as collections of symbols (forgetting 
that the symbols denote sets of points) related by inclusion, addition and 
multiplication. Then any ring of sets appears as a “distributive lattice’, or 
system § of elements S, 7, U satisfying® 


* A (complete) ring of sets is called a (complete) field if and only if it contains the com- 
plement of every one of its members. 

7 Part of this result is proved by H. Hasse, Héhere Algebra, vol. I, 1933, p. 15, and B. 1. 
van der Waerden, Moderne Algebra, p. 14. 

* Cf. the author’s On the structure of abstract algebras, Proce. Camb. Phil. Soc., vol. 31 
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Li: SNAS=S and SUS=S. 

L2:SNT=TNS and SUT=TUS. 

I:(SATNU=SN(TNU) and (SUT)UU=SU(TUD). 

14:SN(SUT)=SU(SNT)=S. 

16: SU(TNUV)=(SUT)N(SUL) and 
SN(TULV)=(SNT/U(SNL),. 


Moreover, two rings of sets seem indistinguishable when and only when they 
are “isomorphic’’—-i.e., admit a one-one correspondence preserving inclusion, 
sums and products.” 
Conversely, every abstractly given distributive lattice is known to be obtain- 
. 10 
able from at least one ring of sets. 


6. Representation theory for distributive lattices. It is generally true in 
representation theories for abstract algebras that one gets the simplest results 
by considering homomorphic (many-one) as well as isomorphic (one-one or 
“true’’) representations. 

A full representation theory for Boolean algebras by fields of sets has been 
developed by Stone,” and it is interesting to see the complications which arise 
in the more general case of distributive lattices. These show that the assump- 
tion that complements exist cannot be eliminated in Stone’s theory. 

First, let R be any distributive lattice, and let @ be any congruence relation” 
on %. Then the elements congruent to O form an “ideal”? D in the sense that 


Il: X «QO and A eR imply AN X OD. 
12: X eOand YeODimply X U Yeo. 


In case 3 is a Boolean algebra, © determines @, but this is not generally true 
in distributive lattices. 

Proof. With Boolean algebras, S and 7 are congruent mod @ if and only if 
(SM 7’) U (S’ 1 T) €M, whereas O is an ideal in the chain of three elements 
I > X > O, determined by two distinct congruence relations. 


(1935), pp. 433-454. O. Ore calls distributive lattices “arithmetic structures’’. Con- 
siderable work has been done by Fritz Klein on the decomposition of distributive lattices 
important in number theory; M. Ward has also given categorical definitions of such systems. 

* Actually, any one-one correspondence preserving any one of these three preserves all; 
this is not true of many-one correspondences. 

1° Cf. the author’s On the combination of subalgebras, Proc. Camb. Phil. Soc., vol. 29 
(1933), pp. 441-464, Theorem 25.2. 

1M. H. Stone, The theory of representations of Boolean algebras, Trans. Amer. Math. 
Soc., vol. 40 (1936), pp. 37-111. By a “representation’’ of a distributive lattice L, we 
mean a homomorphism between ZL and a ring of sets. 

2 L.e., any partition of the elements of R determining an abstract homomorphism. This 
is a basic notion of general abstract algebra, whose detailed definition we shall omit. 
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Again, let % be any distributive lattice, and A any element of R. The relation 
X = Y mod A meaning X U A = Y U A isa congruence relation. 
Proof. That it is an equivalence relation is obvious. Moreover, by L1-L6, 


(YUY)NA=(NU A) U(Y U 4), 
(NN Y)UA=(NU A)N(Y UA); 


i 


hence the correspondence X —» X U A defines a homomorphism of § onto a 
subring of itself. 

If ® is a finite Boolean algebra, there are no other congruence relations on MR; 
this is not true for finite distributive lattices which are not Boolean algebras 
(proof omitted). 


7. Prime ideals. Let us now suppose that & is any distributive lattice, and 
let us attempt to give a full representation theory for R. 

Let 0: R — MN be any homomorphism from F# to a ring of subsets of a class J. 
We may classify the points of J into three categories: those contained in every 
set X € 9, those contained in no set X ¢ 9, and the others. The first two 
categories of points are trivial, and so we can assume that Oe R and J ¢€ R. 

Under these circumstances, every p «J divides the elements of R into two 
categories: those corresponding to sets including p, and those corresponding 
to sets excluding p. The second set of elements is an “‘ideal’’, while the first 
is a “dual ideal’ D in the sense that 


Di: reDandaeR imply a U ve D. 
D2: reDand ye Dimply «NM ye D. 


Hence the representation of R through ® is characterized to within equivalence” 
by which divisions of R into an ideal and complementary dual ideal occur, and 
how many times cach occurs. 

But conversely, by [1-12 and D1-D2, if one is given any correspondence 
associating each division x of R into an ideal J and complementary dual ideal D 
with a cardinal number n(), then this belongs to a representation of R by a 
ring of sets, and so if we define (with Stone, op. cit.) an ideal to be “prime” if 
and only if its complement is a dual ideal, we have 

TuHeoreM 3. The inequivalent representations of a given distributive lattice R 
as a ring of sets are the different functions whose arguments are the “prime ideals’’ 
of R, and whose values are cardinal numbers. 

Remark 1. With Boolean algebras, the number of elements in any prime 
ideal and its dual are the same. Also, no prime ideal contains any other prime 
ideal. Neither of these properties is true in distributive lattices not Boolean 
algebras (e.g., the chain J > X > O). 


8 Te., to within differences between the various points pe/. This is standard termi- 
nology. 
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Remark 2. It is natural to call a representation “irredundant”’ if and only 
if no prime ideal appears as a point more than once. 


8. The finite case. Only exceptionally are the prime ideals of infinite Boolean 
algebras known. But in each finite Boolean algebra of order 2” they are known 
to be n sublattices of order 2”™. 

We shall go further and determine the prime ideals of all finite distributive 
lattices. 

Accordingly, let R be any finite distributive lattice, P any prime ideal in R, 
and D = R — P the dual of P. Form any connected chain* 0 < 4 < 
te << +--+ <a, = I in R; it is clear that in such a chain there will be exactly 
one “link” a; < x;.; such that a; ¢ P and x;., e D, and that x, ¢€ P for k < 7, 
while x, € D for k > 7. 

(8a) We have yeP or yeD according : 
(x; U y) N ays is a; or ayer. 

Proof. Since x; S ji, Xi U YA win) = (a U y) NV xis (by L6 and 
contraction). Again, for any y, obviously 2; S v S 2i41 ; hence either v = 2; 
or v = 2\4; (no further interpolation being possible). But if [r; U (y A 2;.1)] = 
x; ¢ P, then by It (y A 24.1) € P, and so by D2 (since 244; € D), ye P. Similarly, 
if a; U (y A xis) = ais. € D, then by 12 (since 2; € P), y N xi, € D, and so 
by Dl, y e€ D. 

Definition. By a “prime factor” of a distributive lattice is meant any symbol 
r/y, where y < x and no element can be interpolated between y and x. A 
prime factor x/y will be called a “cleavage” for a given prime ideal P if and 
only if ye P and xe(R — P). 

(88) Any prime factor a/b is a cleavage for some prime ideal. 

Proof. Let zx ¢ P if and only if (b U x) N a = b; this makes x « (R — P) if and 
only if (6 U 2) N a = a, since for allz,b < (6 Ux) Na=bU (x Na) Sa, 
and a/bis prime. Clearly a e (R — P) and b« P (by L4). It remains to prove 
I1-I2 and D1-D2. But Il and D1 are obvious, since (b U x) A ais decreased 
resp. increased by substituting ¢ NM y resp. x U y for x. Moreover, under the 
hypotheses of 12, 


b=(bU@Na| UlbUYWNaj=bU[@Na)U Wy Na)| 
=bUl[rUyN@WUaANYWUaNal=bU [ir Uy Nal. 


This proves I2. The proof of D2 is dual. 

TuHeoreM 4. Let R be any finite distributive lattice, and let its prime ideals be 
P,,-+-+-,P,. Then in every connected chain O < 4 < +++ < x, = TI, each 
Ti41/7; ts a cleavage for just one P;— whence r = n. 

Proof. By (8a), if P; # P;, they can have no cleavage in common, and 
by (88), every prime factor is a cleavage for some P; . 

We have the Jordan-Dedekind theorem” on the constancy of the number of 


Tis1) = 


Il 
. 
a | 


14 A chain is called “connected” (or dense by Ore) if no further terms can be interpolated 
in it. 
1% R. Dedekind, Werke, vol. II, p. 254. 
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links in connected chains as one corollary, and using Theorem 3, we have the 
further 

Coro.iary. A finite distributive lattice has (to within equivalence) exactly one 
irredundant isomorphic representation as a ring of sets——and the number of points 
involved is the number of links in its connected chains.” 


. The finite case (continued). Let & denote again any finite distributive 
lattice, let its prime ideals be P, , --- , P, , and let their duals be D, , --- , D,. 

Let further s; = s(P;) and p; = p(D,) be the sum of the ze P; resp. the 
product of the re D;. By 12, s;¢ P;, and by D2, p; « D;; hence (ef. I1-D1) 
xreP; means x S s; and xe D; means x 2 p;. 

Now let J denote the partially ordered set’ of the s;. Call a subset S of J 
“closed” if and only if s; « S and s; S s; imply s; € S. 


9a) R is isomorphic with the ring of ‘“‘closed”’ subsets of J under the corre- 
I £ 


spondence S = A s;. 
ayes 
Proof. Let S be a “closed” subset of J. Then by I1-12 and D1I-D2, A s; 
sjeS 
is in the P; corresponding to these s;, and no others. But given ze R, the 
subset of s; 2 x is closed, y = A s; is in the same P; as x, and hence y U z 
sj2z 
and y fl x are, and so by (88) no prime factor can be inserted between them, 
and «x = y = A s;. Thus the correspondence z= As is one-one. But it 
si2z si2z 


clearly preserves inclusion, while by Theorem 1 the closed subsets of I are a 
ring of sets. This completes the proof. 

Consequently two finite distributive lattices having isomorphic partially 
ordered sets of s; are isomorphic. But the converse is obvious, since the s,; 
are intrinsically defined. Since, finally, if X is any (abstractly given) partially 
ordered set, the ring of its closed subsets is a distributive lattice having the 
“closures” of points of X for s, , we obtain 

TureoreM 5." There is a one-one correspondence between the partially ordered 
sets of n elements and the distributive lattices whose connected chains are of length n. 

In the notation of a previous article (this volume of this Journal, p. 311), by 
(9a) this is the correspondence X = B*. 

Remark 1. The connected components X,,--- , X, of X correspond to the 
indecomposable direct factors of 


B* = (B") x --- & (B*) 


in the direct decompositions of B*. 


% Cf. On the combination of subalgebras, Theorem 17.2. 

‘7 A set is ‘“‘partially ordered’’ (the terminology is Hausdorff’s, Grundziige der Mengen- 
lehre, 1914, Chap. VI) by a quasi-ordering satisfying P3: 2 = y and y = z imply z = y. 
Any subset of a partially ordered set (such as a distributive lattice) is partially ordered 
by the same relation. 

'8 Theorem 5 was announced without proof by the author in a note Sur les espaces dis- 
crets, Comptes Rendus, vol. 201 (1935), p. 19. 
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Remark 2. The “Hasse” diagram’’ for X gives an infinitely more compact 
and intelligible way of writing down a general distributive lattice B* than the 
multiplication table used by Dedekind, or than the “Hasse diagram” for B* 
itself used by recent authors. 

Remark 3. Let ® be any finite ring of sets, and L = B* the distributive 
lattice isomorphic with R. To find X, take the quasi-ordering determined by ®; 
identify points x and y satisfying both x 2 y (M) and y 2 zx (MN); the partial 
ordering induced by the quasi-ordering on the sets of “identified’’ points will 
yield X. 


10. The indecomposable elements. Let us again suppose that # is a finite 
distributive lattice with prime ideals P; , --- , P, having duals D,, --- , D,, . 

(10a) Each s(P;) = 8; is product-indecomposable. Dually, each p(D;) = p; 
is sum-indecomposable. 

Explanation. An element a of a lattice 2 is called “‘product-indecomposable” 
when no two elements x > a and y > a exist with z N y = a; it is called sum- 
indecomposable when (dually) no two elements x < a and y < a exist with 
t U y = a. 

Proof. Vix > s; and y > s;, then xe D; and y e D;, whence (x N y) € D, 
ands Ny # 8;. 

(108) If x € R is product-indecomposable, it is an s;. If it is sum-indecom- 
posable, it is a p;, dually. 

Proof. If x is product-indecomposable, then it yields a unique prime factor 
a/x: let P; be the corresponding prime ideal. Clearly x = s(P,), since if 
x < s(P;), we would have y = s(P,), whence y « P;. 

Coro.tiary. The number of sum-indecomposable resp. product-indecomposable 
clements of a finite distributive lattice is the length of its connected chains. 


10a. The free distributive lattices. Consider the “free’’ distributive lattice 
generated by n symbols 2, ,--- ,2, ; Theorem 1 inclines one to adjoin to it 
elements O and J such that O < a2; < I for all 2;. 

If this is done, then the product-indecomposable elements form a Boolean 
algebra B" of 2" elements. More precisely, they are the elements, O, 2; , 
az, Ue2e;,7, U2z;Uni,mn U2, Uz, Un,---,m U-s- Um, U 
te, U--- Uax,.x, U--- Ux,. The corresponding prime factors are 


ry N--- NN 2x,/0, 
I U (a fi ees NM x, Aas coe 1M xn)/2e , 


(7, U--- Vay Uay, U--- Ux, Ux, 
/(x1 U---U Tk~1 U T+ U.:-- U z,), 
I/(x, U sae U r,). 


"Cf. H. Hasse, Hohere Algebra, vol. U1, 1927, p. 103, p. 123. 
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It is a corollary that the free distributive lattices with O and J adjoined are 
the B”". 
The proofs of the above statements are tedious; they depend on the knowledge 


” 


of canonical expressions for the elements of the free distributive lattice.~ 


11. A general decomposition theorem. Let FR be any (finite or infinite) 
distributive lattice, and let 


es were Pee a eee 


be any two representations of an element 2 ¢ Ras a product. Then irrespective 
of 7,2; = 7; Ur = 2; U (Ay) = Ae, Uy). Hence if x; is product-indecom- 
i i 


posable, some x; U y; = 2;. This means some y; S$ x;. Symmetrically, some 
x, S y;. Hence either x, = y; = x;, or 2; is redundant in the strong sense 
that some x, < x;, whence 


r=aN---NaeranNaan---Ne,. 


Thus if the decompositions are irredundant, the 2; and y; are equal in pairs, 
r = s, and so 

(lla) Ina distributive lattice, no clement has more than one irredundant product- 
decomposition (sum-decomposition) into elements not themselves further decom- 
posable. 

But conversely, any modular lattice which is not distributive is known” to 
contain a sublattice of five elements a, b, x1, 22, 23 satisfying a < x; < b, 
a; N x; = a, and x; U x; = b [i ¥ jj. Now starting with the two product- 
decompositions a = 2; M x2 and a = a2 MN 2; of a, making further decompositions 
as long as possible, and eliminating redundant components, we see that any 
factor for the second decomposition which contains 2; must contain x2 or 2; 
and hence b—-whereas the first decomposition and those derived from it must 
possess at least one factor containing x; but not b. Hence if the above process 
is terminating, we will surely get two distinct product-decompositions of a. 
But in the presence of the chain-condition, the process 7s terminating. This 
completes the proof of 

TueorEM 6.” A modular lattice satisfying the ascending chain condition is 
distributive if and only if cach of its elements has a unique irredundant product- 
decomposition. 


20 The latter are given by Th. Skolem, Uber gewisse ‘“Verbdnde” oder “Lattices’’, Avh. 
Norske Videnskaps Akademi i Oslo, Mat.-Naturv. Klasse, 1936, no. 7, pp. 7, 8. rom 
them it is immediately obvious that the elements specified above are the only product- 
indecomposable elements—but there are just enough such elements to give the lattice 2" 
dimensions; hence they are all product-indecomposable. 

2 Cf. Theorem 4 of the author's On the lattice theory of ideals, Bull. Amer. Math. Soc., 
vol. 40 (1934), p. 617. 

*2 This result was announced by the author in Abstract 41-1-75 of the Bull. Amer. Math 
Soc., vol. 41 (1935), p. 32. 
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This result is especially interesting in the light of recent proofs by Kurosch 
and Ore that, in any modular lattice, the number of factors in any two irre- 
dundant product-decompositions of the same element into indecomposable 
factors is the same.” 


12. Some enumeration problems. (ne very impartial test of one’s ability to 
classify finite systems is one’s ability to enumerate them. This suggests the 
problem of determining the following combinatorial functions. 

(12.1) The number F;(n) of different rings of subsets of n elements. (This 
is the number of sublattices of the Boolean algebra B" of 2" elements.) 

(12.2) The number F2(n) of non-equivalent rings of such subsets. (This is 
the number of such sublattices non-conjugate under the group of auto- 
morphisms of B".) 

(12.3) The number F3(n) of non-isomorphic rings of such subsets. (This is 
the number of non-isomorphic distributive lattices of “dimensions” n.) 

(12.4) The number F,(n) of non-isomorphic partial orderings of n elements. 

Remark 1. If we replace “ring” by “field” in the above, F;(n) becomes a 
known combinatorial function defined by the recurrence 


H*¥(n+1)=> (;) H* (n — h). 


A=0 
This has been studied by Aitken (Edin. Math. Notes, vol. 28 (1933), pp. xviii 
xxili). Again, F.(n) becomes the partition function—a celebrated asymptotic 
formula for which has been given by Hardy and Ramanujan. And lastly, 
F;(n) becomes n. 


Remark 2. In virtue of Theorems 3 and 5, F3;(n) = > Fi(n). Also, 
° k=1 


F,(n) is by Theorem 1 the number of non-equivalent quasi-orderings of n ele- 
ments, and F,(n) is the number of different quasi-orderings of n elements. 
A table for these functions for small n follows. 


1 2 3 4 5 6 
F\(n) 1 3 29 
F.(n) l 3 9 30 
Fin ) l 2 5 15 51 250 


* A. Kurosch, Durchschnitisdarstellungen mil irreduziblen Komponenten in Ringen und 
in sog. Dualgruppen, Rec. Math. (Moscow), vol. 42 (1935), pp. 613-16. O. Ore, On the 
foundations of abstract algebra. II, Annals of Math., vol. 37 (1936), p. 270, Theorem 11. 
The result of Kurosch-Ore contains a decomposition theorem of E. Noether for ideals, 
and a less well-known result of Remak’s on finite groups, as special coroilaries. 
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In calculating these values, assume that /'2(n) is the number of functions from 
the different partially ordered sets of k < n elements to cardinal numbers whose 
sum isn. Also F;(n) can be calculated combinatorially from F2(n) by summing 
the occurrences of each type of ring of subsets, over the types existing. To 
find Fy(n), separate each partially ordered set into its connected components. 

It would be very interesting to know more about the F;(n), numerically or 
asymptotically. y(n) resembles the function describing the number of groups 
of order 2"-—-whose first values are 1, 2, 5, 14, 51, 266,---. It appears to 
increase more rapidly than the function describing the number of non-isomorphic 
symmetric relations between n objects (or alternatively, the number of non- 
homomorphiec graphs with n vertices), whose first values are 1, 2, 4, 11, 27. 
But as almost nothing is known about the rate of growth of these functions, 
these comparisons are not very reliable. 


13. Homomorphic images and sublattices. [Let us try to determine the 
homomorphisms and sublattices of a given finite distributive lattice, guided by 
the previous results. 

Some authors,” inspired by the numerous analogies between lattices and rings, 
have correlated the congruence relations on lattices with “ideals”? and “normal 
sublattices”. But except in the “complemented” case in which cach element x 
possesses a complement 2’ with x M2’ = Oand x U 2’ = J, this correlation is 
incomplete. 

Actually, in the case of finite distributive lattices, and more generally with 
arbitrary modular lattices of finite dimensions (the author will publish proofs 
elsewhere, in an article on modular lattices), congruence relations correspond 
one-one to subsets of the set of prime factors. 

It follows that, if L = B* is any finite distributive lattice, the congruence 
relations on it are obtained by setting 2 = y(@) if and only if x and y contain 
the same P;. Hence, to obtain the homomorphic images B” of B*, set Y equal 
to any subset of Y having on that subset the same inclusion relation as X. 

The determination of the sublattices of B* is even easier. First, recall that B* 
is isomorphic with the ring of subsets of X which are ‘closed’? with respect to 
the partial ordering of X, by Qa). But a sublattice is clearly just a subring 

and by Theorem 1 these subrings are the families of sets “closed’’ under 
quasi-orderings p of X such that x 2 y(p) whenever rx = yin X. Hence, to 
obtain the sublattices of B*, strengthen the inclusion relation in X to any quasi- 
ordering and consider the partially ordered set Y obtained from this after ele- 
ments x and y such that ¢ = y and y = x have been identified; B” will be the 
general sublattice of B*. 


HARVARD UNIVERSITY. 


“4 Cf. for instance Gr. C. Moisil, Recherches sur l’algébre de la logique, Annales Sci. de 
l’Univ. de Jassy, vol. 22 (1936), pp. 1-118. 











A REPRESENTATION OF GENERALIZED BOOLEAN RINGS 
By N. H. McCoy anp DEANE MONTGOMERY 


1. Introduction. Stone has recently shown' that every Boolean ring is iso- 
morphic to a ring of subclasses of some class. As Stone himself remarks, there 
is a close relation between the representation of Boolean rings and the theory 
of direct sums of rings. The theorem just stated is clearly equivalent to the 
theorem that every Boolean ring is isomorphic to a subring of a direct sum of 
rings F;.* We present here a simple direct proof of this theorem in a some- 
what more general case. 

A commutative ring &, is said to be a generalized Boolean ring of index p 
(often abbreviated p-ring) if p is a prime and if for every a in R, it is true that 
a” = aand pa = 0. A Boolean ring as defined by Stone is therefore a 2-ring.” 
We show here that a p-ring is isomorphic to a subring of a direct sum of rings F,, 
The interest of this theorem lies partly in its generality and partly in the 
simplicity of the proof, which is based on an exploitation of a device used by 
Alexander and by Alexander and Zippin.* Inasmuch as our proof, like Stone’s, 
demands the existence of certain homomorphisms and inasmuch as we prove 
the existence of these homomorphisms by a method analogous to Stone’s,’ our 
proof makes use of transfinite induction. 


2. Subrings of direct sums. For the theorem given here on subrings of direct 
sums neither of the rings considered need be commutative. 

THEOREM |. A necessary and sufficient condition that a ring R be isomorphic 
tu a subring of a direct sum of rings K is that for every a # 0 in R there is a homo- 
morphism h of R into a subring of K such that h(a) # 0. 

Consider first the necessity of the condition. Assume then that the elements 
of R are functions f defined on a certain set M with values in K.° If f, in R 
is not zero, there is some element m such that fi(m) # 0. We obtain a homo- 
morphism of R into a subring of K by making correspond to any f in #& the 
value f(m). This homomorphism is not zero on f; and therefore satisfies the 
condition of the theorem. 

Received January 26, 1937. 

'M. H. Stone, The theory of representations for Boolean algebras, Transactions of the 
American Mathematical Society, vol. 40 (1936), pp. 37-111. See also Garrett Birkhoff, 
On the combination of subalgebras, Proc. Camb. Phil. Soc., vol. 29 (1933), pp. 441-464. 

* In general, for any prime p, F, denotes the field of integers reduced modulo p. 

* When p = 2, the commutativity and the fact that pa = 0 follow from the assumption 
ar=ags. 

* Annals of Mathematies, vol. 35 (1934), pp. 389-395; vol. 36 (1935), pp. 71-85. 

* Loe. cit., pp. 102-104. 

‘ A direct sum consists of the set of all such functions. 
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Turning now to the sufficiency of the condition, let H denote the set of all 
homomorphisms h of /? into any subring of K. Corresponding to each element a 
of R we define on H the function y, , with values in K, as follows: 


(1) yah) = h(a). 


Since A is a homomorphism, it follows at once that 


(2) Ya. v(h) = h(a + b) = h(a) + h(b) = yalh) + yA), 
and 
(3) Yah) = h(ab) = h(a)h(b) = yalh)ylh). 


Thus the correspondence a — y, is a homomorphism of F into the ring S ol 
functions yz, a C Rk. To prove that this is actually an isomorphism, we need 
to show that the function y, vanishes identically on H only if a = 0. This 
follows almost at once, for we have assumed that if a + 0 there is an A, in H 
such that hi(a) = ya(hi) # 0. Thus R is isomorphic to S, and the proof is 
completed by noting that the ring S is a subring of the ring of all functions on 
H to K, and is therefore a subring of a direct sum of rings K. 

A theorem entirely analogous to Theorem 1 holds for groups or, more generally, 
for abstract algebras as defined by Garrett Birkhoff. 


3. Imbedding theorem. The following theorem is demonstrated for any 
prime in the same way in which Stone demonstrates it for the case p = 2. 

Turorem 2. A p-ring R,, may be imbedded in a p-ring R® which contains a 
unit element. 

Let us denote the clements of F, by 0, 1,---, p — 1. The elements of R* 
will be the pairs (r, 7), where r is in R, and fi isin F,. If (m1, 71) and (re, 2) 
are two pairs of the kind described, their sum is defined to be (7, + re, iy + jie) 
and their product is defined to be (rir2 + ner, + mire, tite). These elements 
form a commutative ring under this definition. Clearly p times any element 
is zero and it can also be verified that any element raised to the p-th power is 
itself. The ring thus formed is the desired ring, for the elements of the form 
(r, 0) form a subring isomorphic to R,, and the element (0, 1) is a unit element. 


4. Finite p-rings. Let / be a p-ring containing a finite number of elements. 
Since every clement a of F satisfies the equation a” = a, it follows at once that 
if a" = 0 for some positive integer n, then a = 0. Thus F does not contain a 
radical and is therefore known to have a unit element and to be a direct sum of 
fields.’ These fields, being subrings of F, are clearly also p-rings. 

It will now be shown that F, is the only field which is a p-ring. Suppose S 
is such a field, the unit element of S being denoted by e. Then S contains a 
field F’, isomorphic to F,, and consisting of the integral multiples of e. Since 


7B. L. van der Waerden, Moderne Algebra, vol. 2, p. 163. 
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ze’ —x=2(¢ — 1):-- [x — (p— 1)] (mod p), 
it follows that each element a of S satisfies the equation 
x” —x = x(x — e)--- [x — (p — Ie] = O. 


Now it is easy to show that a satisfies a unique equation f(x) = 0 of minimum 
degree, with coefficients in F’, and with leading coefficient unity. Further, f(x) 
must be irreducible in F’,, as otherwise S would contain divisors of zero. But 
f(x) divides x” — zx and therefore is merely one of its linear factors, and thus a 
is an element of F’,. These results yield the following theorem: 

TuroreM 3. Every finite p-ring contains a unit element and is a direct sum 
of fields F,. 

The following remarks will be useful in proving the existence of homomor- 
phisms in the next section. Let R, be any p-ring containing a unit element e 
and let a be any element in R,. The ring {a, e} generated by a and e consists 
of all polynomials in a and e. Since a? = a and since pa = 0, this ring is 
finite and since it is a p-ring, it is expressible as a direct sum of fields F,. Thus 
there exists a set of non-zero elements e¢;, ¢:, --+ , ¢, of fa, e} with the following 
properties: 


(4) e@2aTat*** + Cry e = Ci, Cie; = 0 (a = j). 


Svery clement of {a, e} is expressible as a linear combination of the elements e, 
with coefficients in F,. Furthermore the elements e; are linearly independent 
over F,,. We shall call this set a basis of {a, e}. 


5. Existence of homomorphisms. Let FR, be an arbitrary p-ring containing 
a unit element e and let S be a subring of R, which contains ec. If @ is an 
element of R, not in S, denote by S(a) the subring of R, generated by S and a. 
The elements of the ring S(a) are expressible as polynomials in a having coeffi- 
cients in S with degree at most p — 1. Now let e; (¢ = 1, 2, --- , r) be a basis 
of fa, e} as in the preceding section. Each integral power of a is a linear com- 
bination of the e,’s with coefficients in F,, and since e is also such a linear 
combination each element b of S(a) may be written in the form 


(5) b = bie: + doe, +--+ + b-e,, 

the coefficients b; being elements of S. If ¢ = ce: + +--+ + ©, is another 
element of S(a), it follows from (4) that 

(6) b a c= (db; a Ce) — deta oa (b, oe C,)e, : 


be = (dieses + +++ + (b-0,)e, . 
If b = 0, it follows from (5) and (4) that 0 = e;b = be; , and thus 
(7) by; be eve b, = bi be 7s b.(e, tere + e,) = @(. 


We shall now prove the following lemma. 
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Lemma. Let S be a subring of R,, containing the unit ¢ of R, , and let h be a 
given homomorphism S— F,. Then there exists a homomorphism h', S(a) — F, , 
such that under h' the images of elements of S are identical with their images 
under h. 

The symbol P, will be used to represent the direct sum of r rings S, the 
elements of P, being denoted by (hb; , be, --+ , b.), where each 6; is an element 
of S. In like manner C, will be used to represent the direct sum of r rings F’,, 
Let K denote the ideal in P, consisting of those elements (b; , be , +++ , b-) such 
that be: + --- + be, = 0. That K is an ideal follows from (6). 

Now h induces a homomorphism (b; , b2, «+: , b-) — (br, b3, --*, b*) from 
P, to C,, where b; > b: by h. Denote by L the ideal in C, which is the image 
of K under the induced homomorphism. The ideal L can not contain 


(1, 1,-+++,1), for if (b1, be, ---,b-) — (1, 1,---, 1), then bib --- b, # 0, 
and from (7), (bi, be, «++ , 6) can not be in K. Therefore L does not include 
all of C.. Any ideal in C, is made up of elements (a , 72, +--+ , 2), where for a 


certain fixed set of 7’s, x; = 0, and for the remaining 7’s, 2; may take any value 
in F,,. Since LZ is not identical with C,, we may assume that LZ consists of all 
elements (0, +++ ,0, a,°+:,2,), where k > 1, and x. +--+ ,2, are arbitrary 
elements of F,, . 


We now set up the correspondence 
(S) b = bey os eee a b.e — br 


and proceed to show that this is the required homomorphism h’. By (6) this 
will be a homomorphism S(a) — F,, provided the indicated correspondence is 
independent of the representation (5) of any given element b of S(a). If b 


can also be expressed as ¢,¢; + ++: + ¢,e,, it follows that (b: — exer + --- + 
(b, — c,)er = Oand therefore [(b: — c)*,--:, (b- — ¢,)*] = (br — ct, 
eC - cr) is an element of LZ. From the form we have assumed L to have, 


it follows that bf — cf = Oand hence bf = cr. Thus (8) defines a homomor- 
phism S(a)— F,,. If x is any clement of S, then from (8) we find 


r=2re = r(e, +--+ + .e,) > a* 


and the homomorphism h’ coincides with h on S. This completes the proof. 

TueoreM 4. If R, is any p-ring containing a unit element e and if a is any 
non-zero element of R,, , there exists a homomorphism h of R,, into F,, such that 
h(a) ¥ 0. 

Consider first the ring {a, e}. This ring is a direct sum of rings F, and 
therefore a homomorphism h may be defined on it to F, in such a way that 
h(a) # 0. It remains only to extend h to the remainder of R,. If the elements 
of R, not in fa, e} are well ordered, the desired extension may be made, in 
view of the lemma just demonstrated, by transfinite induction.® 


* Cf. Stone, loc. cit., pp. 102-104. 











REPRESENTATION OF GENERALIZED BOOLEAN RINGS 459 


6. Representation of p-rings. We are now in a position to prove our principal 
theorem. 

THeoreM 5. If R, is any p-ring, it is isomorphic to a subring of a direct sum 
of rings F,, . 

By Theorem 2, 2, may be imbedded in a p-ring R’, with a unit clement, and 
by Theorems | and 4, R%, is isomorphic to a subring of a direet sum of rings F,. 
Therefore &,, itself is isomorphic to a subring of a direct sum of rings F,,. 


SsmirH Co.LLece 











A STRUCTURAL CHARACTERIZATION OF PLANAR 
COMBINATORIAL GRAPHS 


By Saunpers Mac Lane 


1. Introduction. There are several known necessary and sufficient conditions 
that a combinatorial graph be planar.’ This paper aims to establish another 
such condition which has a more intrinsic character, in that it is obtained directly 
from an analysis of the structure of the graph. More explicitly, the new con- 
dition depends on a unique decomposition of the graph into certain maximal 
triply connected subgraphs. This decomposition can be viewed on its own 
merits as a generalization of the Whyburn cyclic element theory. 

The first combinatorial criterion for a planar graph is due to Kuratowski,” 
who showed that a graph is planar if and only if it contains no subgraph homeo- 
morphic to one of the two following graphs: the graph composed of five vertices 
and ten edges, in which each pair of vertices is joined by an edge; the graph 
composed of six vertices, arranged in two sets of three vertices each, and of nine 
edges, such that each vertex of the first set is joined to each vertex of the second 
set by an edge. Subsequently, Whitney defined combinatorially the relation 
between a graph and its planar dual and showed that a graph is planar if and 
only if it has a planar dual.’ A third condition states that a combinatorial 
graph is planar if and only if it contains a complete independent set of circuits, 
modulo 2, such that no edge appears in more than two of these circuits.* 

The application of any of these theorems to a particular case has a haphazard 
character because one must investigate any possible smallest non-planar sub- 
graph or any possible dual or any possible complete set of circuits. We seek 
an intrinsic condition; that is, a condition expressible in terms of configurations 
which are associated in a unique manner with a given graph. An example of 
such a condition is the result of Whitney, that any graph G can be broken up 
uniquely into non-separable components” and that the graph is planar if and 
only if each of its non-separable components is planar.’ 

Received January 28, 1937; presented to the American Mathematical Society, December 
30, 1936. 

1 For definitions of terms see §2. 

2K. Kuratowski, Sur le probléme des courbes gauches en topologie, Fundamenta Mathe- 
maticae, vol. 15 (1930), pp. 271-283. 

3H. Whitney, Non-separable and planar graphs, Transactions of the American Mathe- 
matical Society, vol. 34 (1932), pp. 339-362. 

*S. Mae Lane, A combinatorial condition for planar graphs, Fundamenta Mathematicae, 
vol. 28 (1937), pp. 22-32. 

5 It is to be hoped that such a condition may throw light on the question of when a graph 
is mappable on the torus. 

6 These components of G are precisely the true cyclic elements of @. See G. T. Why- 
burn, Concerning the structure of a continuous curve, American Journal of Mathematics, 
vol. 50 (1928), pp. 167-194. For the combinatorial decomposition into components see D. 
Kénig, Theorie der endlichen und unendlichen Graphen, Leipzig, 1936, Ch. 14. 

7H. Whitney, loc. cit., Theorems 12 and 27. 
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These non-separable components are cyclically connected, since they are not 
disconnected by the removal of any one vertex. Similarly, a graph G is triply 
connected if the removal of any two of its vertices either does not disconnect G, 
or else disconnects G into two parts, one of which is only a chain. Some results 
of Adkisson and Whitney* indicate that such a triply connected graph G can 
have but one map on the sphere. In this topologically unique map, the circuits 
which bound the connected regions of the complementary sect are the only 
circuits which do not “cut” the graph; that is, they are the only circuits whose 
removal does not disconnect the graph. The condition that these circuits really 
give possible region boundaries for a map yields (§4) an intrinsic condition that 
such a triply connected graph be planar. 

It is then natural to try to reduce a graph G which is not triply connected to 
triply connected constituents. This might be done by choosing two vertices 
p and q whose removal disconnects the graph G into two subgraphs H and H’. 
However, planar maps of H and H’ cannot always be combined to form a map 
of G. This would be the case if H is modified by adding one new are joining 
p to q. The so-modified graph we call a “block” of G. Any graph can be 
broken up successively (‘split’) into blocks so that the final blocks, called 
xtoms, are triply connected (cf. §2). These atoms are uniquely determined (see 
§3) except fora homeomorphism. In terms of this combinatorial decomposition, 
we obtain our fundamental result that a non-separable graph is planar if and 
only if each of its triply connected atoms is planar (see §5). In the last section 
we further illustrate the applicability of these atoms by showing that the number 
of topologically distinct maps of a planar graph can be directly computed from 
the number of atoms and the number of “multiple” splits of the original graph. 


2. The splitting process. We first state some preliminary definitions. A 
combinatorial graph G consists of a finite number of elements a, 8, --- called 
“edges” and a finite set of “‘vertices’’, p, q, --- , where each edge 8 is “on” 
exactly two distinct vertices p and g, which may be called the “ends” of 8. 
Any set of edges in G together with all the vertices on these edges form themselves 
a subgraph. We write H C G for “H is a subgraph of G”’. If H, and He are 
subgraphs of G, then H, N Hz is the subgraph containing those edges in both 
H, and Hz, while H, + Hz is the subgraph containing those edges in either 
H, or Hz, and G — H, is the subgraph containing all edges of G not in H, . 
“Circuits” and “chains” are defined as usual. A hanging chain in G is a chain 
none of whose vertices, except perhaps its two ends, are on more than two edges 
of G. Two graphs G and G’ are called homeomorphic if and only if @ ean be 
changed into G’ by one or more of the following operations: 


8V. W. Adkisson, Cyclicly connecled continuous curves whose complementary domain 
boundaries are homeomorphic, preserving branch points, Comptes Rendus des séances de la 
Société des Sciences et des Lettres de Varsovie, Classe ITT, vol. 23, pp. 164-193. 

H. Whitney, Congruent graphs and the connectivity of graphs, American Journal of Mathe- 
matics, vol. 54 (1932), pp. 150-168. 
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(i) The replacement of a hanging chain by a new hanging chain having the 
same ends; 

(ii) The renaming of edges or vertices or both.’ 

If no vertices are renamed in the homeomorphism, each branch vertex of G is a 
branch vertex of G’, and so we say that the branch points are preserved. Two 
homeomorphic graphs are topologically equivalent, in that any map of the one 
is topologically homeomorphic to any map of the other. 

A graph B consisting only of t 2 3 edges all having the same two ends p and 
q, or any graph obtained from B by replacing these edges by hanging chains, 
will be called a branch graph with ¢t branches and with the termini p and q. 

Because of Whitney’s results on separable graphs, stated in the introduction, 
we shall consider henceforth only non-separable graphs G. A graph G, not a 
single edge, is non-separable (cyclically connected) if and only if each pair of 
vertices of G is contained in a circuit of G.”° 

A semi-split of G at the vertices h; and he is a representation of G as a sum 


(1) G=H+H’, 


where H and H’ are two non-void subgraphs, having in common no ares and 
no vertices except the vertices h; and hz. A split of G at h; and he is a representa- 
tion (1) which is a semi-split and where neither H nor H’ is a chain. Since G is 
non-separable, 7 must contain both i; and he , and must be connected. Hence 
there is a chain in H with ends A; and he , and in like manner, a chain in H’ with 
the same ends. 

Take any chains X and X’ contained in H’ and # respectively, and having 
the ends h; and he. The two subgraphs 


(2) H + X, H’ + X’; ac, X’ CH 


will be called blocks'' of G corresponding to the split (1). We say that G@ has 
been split into these two blocks. These blocks are not uniquely determined, 
for X may be replaced by other chains from H’ with the same ends. However, 
any such X is a hanging chain in the block H + X, so that this block is uniquely 
determined up to a homeomorphism. 

A graph G will be called triply connected” if it is cyclically connected, if it 


*H. Whitney, On the classification of graphs, American Journal of Mathematies, vol. 55 
(1933), pp. 236-244. 

10 Whitney, Non-separable graphs, loc. cit., Theorem 7. Alternatively, a graph is non- 
separable if it is not disconnected by the removal of any one vertex. 

"Since H + X is obtained by replacing all ares of H’ by a single chain X, we may con- 
sider this block H + X as a sort of ‘‘factor-graph’’ of G modulo H’. This analogy with 
factor-groups is suggested because the group of cycles of JJ + X (mod 2) is isomorphic to 
the faector-group of the group of eycles of G modulo the group of cycles of H’, both taken 
mod 2. 


' This notion of triply connected (call it TCM) does not always agree with the notion 
of triply connected (TCW) introduced by Whitney, in Congruent graphs, loc. cit., p. 158. 
The two definitions agree for graphs G containing at least four vertices and containing 
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has no split, and if G is neither a circuit nor a branch graph. A cyclically con- 
nected graph @ split into two blocks may be further decomposed by splitting 
one of these blocks which may happen not to be triply connected. Such sue- 
cessive splits finally yield a set of unsplittable blocks. By the definition of triple 
connectivity, each unsplittable block is either a branch graph or a triply con- 
nected graph. The branch graphs are unimportant. The triply connected 
blocks we call atoms, and all the triply connected blocks in a final set of un- 
splittable blocks will be said to constitute a complete set of atoms of G. 

We now list two useful consequences of our definitions. 

Lemma 1. If in the graph G there is a chain L which does not pass through” 
either verter hy or hz , then L is contained in one of the blocks of any split at the 
vertices hy and he . 

The proof follows at once from the definition of a split. 

Lemma 2. A block of a cyclically connected graph is always cyclically connected. 

Proof. By hypothesis any two vertices p and gq in the block H + X of (2) 
lie together on a circuit C in G. If C does not lie entirely within the block 
H + X, then one piece of C between h; and hz can be replaced by the chain X, 
giving a new circuit C* in the block and connecting the vertices p and q. 


3. A unique characterization of atoms. This section will give a combinatorial 
proof that the atoms of a graph are unique, up to a homeomorphism. This 
will be done by giving an invariant characterization of these atoms as maximal 
triply connected subgraphs of G. Here a subgraph 7 of G is maximal triply 
connected (max. trip. conn.) if 7 is contained in no other triply connected sub- 
graph W # T. 

THEeoreM |. Every atom of G is a maximal triply connected subgraph. If 


(3) A,, Ae, -+:,Am 


is a conrplete set of atoms of G, then every maximal triply connected subgraph of G 
is homeomorphic, preserving branch points, to one and only one of the atoms (3). 

Proof. If T is any trip. conn. subgraph of G, then in the split (1), one of the 
blocks contains a trip. conn. subgraph 7* homeomorphic, preserving branch 
points, to 7. To show this, use the equation 


(4) T=(TNA+(TNA, 


where 7M H and 7 —M H’ have in common no ares and only the vertices A; and 
hg. As T is trip. conn., this cannot be a split, and one of the subgraphs, say 
T 1-H’, is void or a single chain. If it is void, then 7 C H, so that 7 itself is 


neither a circuit £, consisting of only two edges, nor a vertex p, on only two edges. A 
graph containing a circuit Z is never TCM, but may be TCW. A graph containing a ver- 
tex p, as above, is never TCW, but may be TCM. The definition TCM used above has the 
advantage that it is invariant under any homeomorphism of the graph. 

'? A chain L does not pass through / if A is not in L or is only an endpoint of L. 
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in one of the blocks. On the other hand, if 7 NM H’ is a single chain Y, this 
chain must have the ends h; and he , so that 
(5) T*=(T-Y)+X (Y=TN H’) 
is a new subgraph homeomorphic to 7, preserving branch points, and 7* is 
contained in the block H + X, as required. 

To show that any trip. conn. atom A is max. trip. conn., suppose instead that 
A CT,A # T holds for a trip. conn. T with more edges than A. If we make 
one of the splits (1), leading up to the construction of the atom A, then A belongs 
to one of the blocks (2), say to the block H + X. But 7 D A, so that T con- 
tains at least a circuit of H, and the 7* constructed above from T must certainly 
be in the same block H + NX with A. Because 7’ D A, and because T* is ob- 
tained from T by changing at most one chain not in A, we must have T* D A. 
Therefore, the atom A is contained in a larger trip. conn. subgraph 7* of the 
block H + X. We repeat this argument, getting A in successively smaller 
blocks until A is itself a block contained in a larger trip. conn. subgraph 7; , 
which in turn is contained in the block A. This is a manifest impossibility. 
Hence every atom is max. trip. conn. 

Consider any max. trip. conn. subgraph 7’. We know that it is homeo- 
morphic, preserving branch points, to a 7* in the block H + X. This 7* is 
max. trip. conn. in this block. This is obvious if 7* = T. Otherwise T + T*, 
and we have Y # X in the construction (5) of 7*. Were 7* contained in a 
larger trip. conn. subgraph W in this block, then W, perhaps modified by re- 
placing the chain X by the chain Y from (5), would be a trip. conn. subgraph 
properly containing 7’, contrary to the assumption that 7 is maximal. As a 
result 7* is a max. trip. conn. subgraph homeomorphic to 7 and contained in 
the block H + X. Continuing this, we finally obtain a max. trip. conn. sub- 
graph 7, homeomorphic, preserving branch points, to the original JT and con- 
tained in a smallest block A. As this A contains the triply connected 7’, , it 
cannot be a branch graph, and so must itself be trip. conn. The maximal 7°, 
must be all of A, so that the original 7’ is homeomorphic to the atom A = 7; . 
It can be homeomorphic, preserving branch points, to no other atom, for no 
two blocks and hence no two atoms can contain the same branch points. The 
theorem is thus established. It states that a set (3) of atoms contains all max. 
trip. conn. subgraphs of G, except for certain homeomorphisms. Hence we deduce 

TueoreM 2. If a cyclically connected graph G can be split up in two ways to 
give two complete sets of atoms, then there exists a one-to-one correspondence between 
these two sets of atoms, so that corresponding atoms are homeomorphic, preserving 
branch points. 

This theorem could also be proved by a direct consideration of two given splits 
by a method similar to that of the Jordan-Hélder theorem; that is, by first 
showing that any two given original splits of G can be subdivided to give homeo- 
morphic results. 


' Exeept in the trivial ease when one block is a branch graph. 
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The characterization of atoms given above can be made independent of the 
notion of a split by means of an independent description of triply connected 
graphs, based on the result of Whitney that the chief characteristic of a triply 
connected graph is the fact that any two of its branch points are the termini 
of three independent ares. These independent ares form a @-subgraph, where 
a “@é-graph” means a branch graph with three branches. 

TureoreM 3. A cyclically connected graph G is triply connected if and only if 

(i) for each pair of branch vertices p and q of G there is a 6-subgraph W of G 
with termini p and q, and 

(ii) G contains no circuits passing through less than three branch points. 

Proof. Suppose first that conditions (i) and (ii) hold while G still has a split 
(1). Since G is not a single chain, and since by (ii) H can not be a branch 
graph, 1 must contain a branch vertex different from both h; and h,.. Similarly, 
H’ contains a branch vertex q distinct from h; and h,.. By (i) there is a 6-graph 
with termini p and q, and each of the three independent arcs of this 6-graph 
must pass through one of the two vertices h; or h, to go from p to g. Thus two 
of these ares intersect, a contradiction. 

Conversely, suppose that G is trip. conn. and suppose, contrary to (ii), that 
G bas a circuit C with only two branch vertices h; and h:. Then G = C + 
(G — C) would be a semi-split of G at hy and he. This semi-split must be a split, 
because were G — C a single chain, G would be merely a 6-graph, contrary to 
the definition of triple connectivity. 

To prove (i) for a trip. conn. G, first modify the graph by replacing each 
maximal hanging chain of G by a single edge. The resulting graph contains 
no vertices not branch vertices, is homeomorphic to G, and so is still trip. conn. 
One argues readily that it is also trip. conn. in the slightly different sense of this 
term used by Whitney.” Then by Whitney’s result” it follows that any two 
vertices of G are joined by three distinct chains, as asserted. 


4. Maps of triply connected graphs. A map o of « combinatorial graph G 
is 2 Correspondence which assigns to each vertex p of G a point ep on the sphere 
and to each edge a of Ga Jordan are o(a) on the sphere, such that the ends of 
o(«) are the maps of the ends of a, while op ¥ og if p ¥ q and two ares o(a) 
and ¢(8) with a # 8 do not intersect, except perhaps at their end points. Any 
subgraph E of G has an image o(E£) composed of all those ares o(a) with @ in EF. 
‘Two maps o(@) and 7(G) of G on the sphere will be considered as identical if 
there exists a topological transformation of the sphere carrying ¢(@) into r(@) 
and each (a) and o(p) into the corresponding r(@) or r(p). 

If G is non-separable, any map o(@) cuts the sphere into a number of con- 
nected domains whose boundaries are the maps of certain circuits of G. These 
circuits we call the complementary domain boundaries (c. d. boundaries) of the 
map. We state without proof the following 


* HH. Whitney, Congruent graplis, loc. cit., p. 158. 
H.Whitney, ibid., Theorem 7, p. 160. 
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THroreM 4. Two maps of a cyclically connected graph on the sphere are topo- 
logically equivalent if and only if they have the same set of complementary domain 
. 
boundaries. 
The possible ¢. d. boundaries of a non-separable graph can be characterized 
. . . ° . 1s 
in the following fashion: 


THeoreM 5. A set of circuits C, , --- Cy, in a non-separable graph G is th 
set of complementary domain boundaries of a planar map of G if and only if each 
edge of G is contained in exactly two of the circuits C, while the circuits Cy, -+- . Cu 


form a complete independent set’ of circuits in G, mod 2. 

Adkisson has shown” that if a cyclically connected graph G has a map in 
which each pair of ¢. d. boundaries has a connected or void intersection, then 
every homeomorphism of @ to itself can be extended to the sphere. This sug- 
gests that any two maps of such a G on the sphere are identical. But the inter- 
section of any pair of c. d. boundaries will be connected or void if and only if 
the graph G can not be split. This agrees with a theorem of Whitney,” which 
states that a triply connected graph has at most one dual (and hence at most one 
map on the sphere). We shall now find an intrinsic characterization of the 
unique map on the sphere of a triply connected graph. 

A circuit C is a cut circuit in the triply connected graph G if there are two non- 
void subgraphs H and H’ of G such that G — C’ = H + WH’, while H and I’ 
have in common only vertices on C. 

TueroreM 6. If G is a triply connected graph with a planar map o(G), then a 
circuit C & G is a complementary domain boundary of o(G@) if and only if C is 
not a cut circuit of G. 

Proof. Any C not ac. d. boundary of @ certainly cuts G into the two non- 
void pieces located respectively within and without the closed curve o(C€). 
Conversely, let C be a c. d. boundary of o(G) and suppose, contrary to the 
theorem, that C is a cut circuit of G. Then oG — oC is not connected, and 
so has m = 2 connected pieces E,, E2,---,E,. Each closure E; consists of 
E; plus end points of ares, and so is the map of a corresponding subgraph G; of @. 


'7 As the number of domain boundaries is finite, this theorem may be readily proved by 
mathematical induction from the fact that a topological correspondence between any two 
closed Jordan curves can be extended to the interiors of these curves. The induction 
could depend on Mac Lane, loc. cit., Theorem 3.1. The theorem also follows by the method 
used by V. W. Adkisson, loc. cit., Theorem 2; it is based on a theorem due to H. M. Gehman, 
On extending a continuous one-to-one correspondence of two plane continuous curves to a cor- 
respondence of their planes, Transactions of the American Mathematical Society, vol. 28 
(1926), pp. 252-265. 

'® This is a re-formulation of the condition quoted in the introduction, using Mac Lane, 
loc. cit., Lemma 4.1, Theorem 5.1, and Theorem 5.3. 

1° The second condition is equivalent to requiring that Ci, --- , Ca. are independent 
modulo 2 and that m — 1 is the nullity of @. The nullity of Gis E(G) — V(@) + P(@), 
where E(G@), V(G), and P(G@) are respectively the number of edges, the number of vertices, 
and the number of connected pieces of G. 

20 V. W. Adkisson, loc. cit., Theorem 3, p. 168. 

“1H. Whitney, Congruent graphs, Theorem 11. 











STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 467 


These G; are the smallest sets into which C cuts G. Those vertices of each G, 
which lie on C we eall the feet of G; . 

Kach G; has at least three feet. For if G; had no foot on C, G would be dis 
connected; if G; had one foot on C, G would be separable at this foot, while 1 
Gi; had two distinct feet, G could be split at these two feet into G; plus the re 
maining part of G, unless G; were a single chain. If G; is a single chain with 
two feet on C, this chain divides the exterior of C into two regions. The two 
subgraphs of G contained in the closures of these two regions, respectively. 
intersect only in the two feet of G;, so that G is again split. Each of these 
results contradicts the hypothesis that G is triply connected. 

Let p, and ps be two feet of G,. Because FE, is connected, p, and pe can be 
joined by a chain L inG,. Any set E; # FE, contains no points of o/. or of the 
c. d. boundary oC, so that #; must lie entirely within one of the two regions 
bounded by of and an are of oC. Therefore, the feet of each G; # G, all lie 
on one of the two ares into which p; and pe divide C. 

We can choose the feet p, and pe of G, so that one of the ares C; into which 
they divide C contains all the feet of G, , but no feet of G, except for p,; and pe . 
Then G ean be split. For let H denote the subgraph of G composed of C; and 
all those subgraphs G; whose feet lie only on C;, while H’ consists of C — C; 
and all those G; whose feet all lie on C — C,. By the previous result, every 
G; belongs to exactly one of these subgraphs, so that @ = H + H’, where H 
and H’ have only p, and pein common. As G, has at least three feet, G, is in 
H’, while G, is in H, so that neither H nor H’ is a single chain. Therefore this 
is a split, contrary to the triple connectivity of G. Hence the e. d. boundaries 
‘an not cut G, as asserted in the theorem. 

THroreM 7. A triply connected graph can not have two topologically distinct 
maps on the sphere. 

Proof. If there is a map, its ¢. d. boundaries are exactly the circuits which 
do not cut G. This property is independent of the particular map, so that there 
ean be only one set of ¢. d. boundaries and hence by Theorem 4 at most one map 
on the sphere. Another immediate result is the following criterion for map- 
pability : 

THEOREM 8. A triply connected graph G can be mapped on the sphere or on the 
plane if and only if the circuits of G which do not cut G form a set of circuits satisfy- 
ing the condition of Theorem 5 for a set of c. d. boundaries. 


5. Maps of cyclically connected graphs. The final theorem on mappability is 

THeoreM 9. A cyclically connected graph G can be mapped on the plane if 
and only if all its atoms can be mapped on the plane; that is, if and only if all its 
triply connected atoms satisfy the mappability criterion of Theorem 8. 

This theorem will follow by induction on the number of atoms once we show 
that a graph G split into two blocks has a map whenever both blocks have 
maps. This fact we state in the following more explicit form. 

LemMa 3. /f G ts split as in (2), and if + and r’ are maps on the sphere of the 
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blocks H + X and H’ + X’ respectively, where in + the chain X appears in two c. d. 
boundaries C and D, while X’ appears in 7’ in the boundaries C’ and D’, then there 
is a map of G in which the c. d. boundaries are the c. d. boundaries of + and 7’, 
except that the circuits C, D, C’, and D’ are replaced by 


(6) (C — X)+ (C’ — X’), (D — X) + (D’ — X’). 


Proof. Because the chains C — X and C’ — X’ belong to H and H’ respec- 
tively, they have no ares in common, while by construction they both have 
the two vertices h; and he as ends. Hence (C — X) + (C’ — X’) and likewise 
the other graph in (6) is actually a circuit in G. Once the lemma is established, 
a simple change of notation will give a similar result with (6) replaced by 


(7) Cc — X)+ — Xx, (D — X) + (C’ — X%. 


To prove the lemma we need only squeeze the map 7(H’) into the region 
of the map 7(//) originally occupied by 7(X). This may be done as follows. 

Draw an are r(Z) with ends r(A;) and r(he) in the region bounded by C and 
draw a similar are 7(Z’) inside C’. In these altered maps the c. d. boundaries 
are as before except that C and C’ are replaced by 


(8) Z+(C—X), Z24+4K, Z' + (C’ — X’), 2+ NX’, 


respectively. One of the two regions of the sphere bounded by X + Z contains 
the rest of 7(H7). Call this region the “outside” of X + Z, with a similar con- 
vention as to X’ + Z’. Then map the outside and boundary of X’ + Z’ topo- 
logically on the inside and boundary of X + Z in such a fashion that r’(A,) and 
(he) go into r(/,) and r(he) respectively, while r’(X’) goes into 7(X) and 
7'(Z’) into 7(Z). The two maps so combined have e. d. boundaries which 
are the original c. d. boundaries of 7 and 7’ except that C, C’, D, and D’ are 
replaced by 


9%) Z2+(C-X), 2+(C' -—X), X+(D—-—X), X + (D — X’). 


If we remove the maps of X and Z, a map of G with the required c. d. boundaries 
will remain. The lemma being established, the theorem follows. 


6. The number of maps of a graph. In this section we denote by u(@) the 
number of topologically distinct maps of a graph G on a sphere. If one map of 
G is given, other maps may be found by “rotating” one of the components of a 
split of G or by permuting several components which are connected in “‘parallel’’. 
To show that all distinct maps of G can be obtained by sequences of such opera- 
tions, we first discuss such components in “‘parallel’’. Given two vertices p and 
q of G, we say that two edges @ and 8 of G are connected outside of p and q if there 
is a chain L of G containing @ and 8 but passing through neither p nor gq. The 
relation, ‘a is connected to 8 outside p and q’, is reflexive, symmetric, and 
transitive. Therefore G is subdivided into disjoint subgraphs M, , Me, --- , M; 
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such that two edges of G belong to the same subgraph if and only if they are 
connected outside p and g. Consequently, 


(10) G= M,+ M.+ hig + M, 


is a representation of G in which any two of the subgraphs M; and M; have 
only the vertices p and gincommon. Furthermore, no subgraph M; has a semi- 
split at p and q, so that the representation (10) can not be further subdivided. 
If t > 2 in (10), we say that G@ has a multiple split of order t at p and q. 

TuroreM 10. If G can be mapped on the sphere, the number u(G) of topologically 
distinct maps of G on the sphere is 


k 
(11) wG) = 2" TT e, — 1), 
i=1 


where a = a(G@) is the number of atoms in a complete set of atoms of G, where G has 
k distinct multiple splits, and where the i-th multiple split has the order t; . 

We shall establish this theorem by constructing the atoms from “least” 
splits of G. A component H in a split (1) of @ at hy and he will be called least 
if the component 7 has no semi-split at A; and he ; that is, if any two ares of 7 
are connected outside h; and he. <A split (1) of @ will itself be called least if 
either of the components H or H’ of the split is least. 

Lemma 4. Let (1) be a least split of G into H and H’. Then in any map ¢ of G 
there are two c. d. boundaries which have edges in both H and H', while any other 
c. d. boundary is entirely in H or entirely in H'. 

Proof. Edges of both H and H’ abut on the split point h;. The cyclic order 
of the complementary domains about h; indicates that through this point there 
must pass at least two c. d. boundaries which have edges in both H and H’. 

Suppose there were three (or more) such boundaries with edges in both // 
and H’. If the three complementary domains with these boundaries are re- 
moved, the remaining part of the sphere falls into three parts which touch only 
at hy and h.. Since H has edges in all three c. d. boundaries, it follows readily 
that H must have edges in at least two of the three parts of the sphere. As 
these two parts of H touch only in A; and hz , H has a semi-split at these vertices. 
H’ likewise has a semi-split, contrary to the assumption that one of H and I’ 
is least. Therefore there are but two ec. d. boundaries of the sort considered. 

Lemma 5. If G has a least split (2), there exists a 2-1 correspondence between 
the maps o of G on the sphere and the pairs of maps (7, r'), where + and 7’ are maps 
on the sphere of the blocks H + X and H’ + X’ respectively. 

Proof. A given map o of the whole graph G is also a map of each of the sub- 
graphs (or blocks) H + X and H’ + X’, and hence does correspond to a pair of 
maps of these blocks. Specifically, let ¢ be determined (Theorem 4) by its ¢. d. 
boundaries 

, , 


(12) Ci, Ce, ---,Ce, Ci, °°: , Ca, L + L’, M + M’, 


ys ° ° ° ° yf © ‘ ° . 
where each C; is a circuit contained in H, and each C; is contained in H’, while, 
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as in the last lemma, the two circuits with edges in both H and H’ consist of 
chains L and M in H and L’ and M’ in H’. The sub-map of H + X arises by 
dropping from the map (12) all edges of H’ — X. The first circuits C; to C, 
must thus remain as ¢. d. boundaries in the map of H + X, while by Theorem 5 
there must be two other c. d. boundaries containing all of Z and all of M respec- 
tively. Furthermore, X must be in two boundaries, so that the additional 
boundaries are just XY + Land X + M. This gives a map of H + X with the 
c. d. boundaries 


(13) Ci,Ce,-->, Ck, L+ X, M + X, 


and similarly a map of the other block H’ + NX’ with the boundaries~ 
(14) Ci,Cz,--+,Cn, L'+X', M'+X’. 

Therewith we have a correspondence between the map (12) and the “pair of 
maps” (13) and (14). Conversely, given any two maps (13) and (14) of the 
two blocks, there can be at most two maps of G corresponding to (13) and (14) 
in this fashion; namely, the maps given by the c. d. boundaries in (12) or in 


(15) fo. &. C6, £407, Bel. 


Both of these maps (12) and (15) are geometrically possible by Lemma 3. 
Hence the correspondence is a 2-1 correspondence, as asserted in Lemma 5. 

LemMa 6. If G has a least split (1), then, for any multiple split of G of order 
tat p and q, one of the blocks (2) of the least split has a multiple split of order t at 
p and q, while the other block has no multiple split at p and q. The splits so ob- 
tained are the only multiple splits of the blocks. 

The second statement is simple, for if one of the blocks has a multiple split 
at the vertices r and s, then, by the definition of a split in (10), the original 
graph G also has a multiple split at rand s. The proof of the first part depends 
essentially on the fact that one of the components, say H/, of the given split (1) 
is least. 

Casel. p=h,andq = he. As H is least, it must be one of the M; in the 
given multiple split (10) of G, say M,. Then 


H’ = Mz+---+ Mi, 


so that the other block H’ + X’ has a multiple split like (10) with M, replaced 
by Y’, and this multiple split has the order ¢. 

Case 2. Neither H nor H’ contains both p and q, so that one vertex, say p, 
is in 7 but not H’, while q is in 17’, but not in H. But in the multiple split (10) 
each M,; must, because @ is cyclically connected, contain a chain with ends 
p and q, and this chain must pass through one of the split vertices h, or he of the 
split (1). Since each of these split vertices is in but one of the M’s, there can 


* This could also be proved purely combinatorially by showing that if the set of circuits 
(12) satisfies Theorem 5, the two sets (13) and (14) also satisfy Theorem 5. 
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thus be only two subgraphs M;, and the presumed decomposition (10) is no 
honest multiple split. 

There remains the case when p and q are both in one component, say the 
component H of (1), while one of the split vertices, say h; , is at neither p nor q. 
Then any are of H’ is connected to this h; by a chain not passing through either 
p or q, so that all of H’ is contained in one of the pieces M; of the given multiple 
split (10) of G. Thus, when all these edges of H’ are replaced simply by X’, 
(10) gives a multiple split of the block H + X with the same order t, while 
the other block H’ + X’ can not contain both p and q as branch vertices and so 
certainly has no multiple split at p and q. 

In all cases, the given multiple split yields a similar multiple split of just one 
of the blocks (2), so that the lemma holds. 

LemMa7. If agraph G has no least split, then G is triply connected or is a branch 
graph or a circuit. 

For if G is not triply connected, it must have a split at some pair of vertices 
(p, q), and hence a representation (10). There must be t 2 3 terms, because 
the split is not least. If one of the subgraphs M; were not a single are, then 
G = M; + (G — M,) would be a split in which M; is least. Such a least split 
is impossible by hypothesis, so each M; is a single arc, and G is a branch graph. 

To prove Theorem 10, decompose G by successive least splits until none of the 
resulting blocks have least splits. By Lemma 7 these blocks are either branch 
graphs or else are triply connected and hence atoms of G, so that we have a set 
of blocks 


(16) A,,As,°*>,Ae;, B,, Be, -:-, Bs, 


where the a graphs A; form a complete set of atoms, while the B; are branch 
graphs. By Lemma 6 each multiple split of G corresponds to one and just one 
multiple split with the same order in one of the blocks (16). Of these blocks, 
only the branch graphs B; have multiple splits, while G has multiple splits of 
order t;, 7 = 1, +--+ ,k. Hence the branch graphs in (16) can be so arranged 
that B; is a branch graph with ¢; branches. There are 8 = k such graphs. 

In the set of subgraphs (16) each triply connected atom A; has but one map 
on the sphere, while each branch graph B; with ¢; branches has (¢; — 1)!/2 differ- 
ent maps (this is the number of ways of arranging the ¢, branches in cyclic order). 
We combine the maps of (16) to get maps of G. Each step in the combination 
will by Lemma 5 yield two alternative maps of G, while there are a + k — | 
combinations in all. Hence there are 


oath II (ti “2 I)! _ get II -— 0! 


i=] i=l 


different maps of G, as asserted in the theorem. 
Any planar map of G can be obtained by stereographically projecting a map 
from the sphere onto the plane. Distinet maps are obtained when the north 
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pole is chosen in distinct regions, and the number of such regions is simply 
N(G) + 1; therefore we have the 

CoroLiary. The number of topologically distinct maps of a cyclically connected 
planar graph on the plane is 


(G) = [N(G) + 1G) = ING + 12" IL @ — vt, 
i=l 


where N(G) is the nullity of G and the other constants are given as in the theorem. 


CORNELL UNIVERSITY. 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 
By ArtHur B. Brown 


1. Introduction. A well known theorem in differential geometry concerns the 
normal curvatures of curves through a point on a 2-dimensional surface in 
3-space. It states that either the curvature is constant, independent of the 
direction of the curve, or else there is one direction giving a maximum to the 
curvature, and another (perpendicular) direction giving a minimum. We gen- 
eralize this result to the case of an n-surface in a Riemannian (n + 1)-space. 
In place of merely a maximum and a minimum, there is in general a non- 
degenerate critical point of each type or index’ 0, 1, ---, mm — 1. A similar 
result is obtained for an arbitrary subspace of a Riemannian space, the theorem 
being stated in terms of projections on any direction orthogonal to the sub- 
space. A final theorem, with a similar statement regarding critical values, con- 
cerns the Ricci mean curvature in a Riemannian space. 


2. The principal directions’ for a real quadratic form. Our results regarding 
critical values will be based on the following theorem. 
THeoreM 2.1. Given the real quadratic form® 


(2.1) 2 = 4;;2;2;, 


on the locus 
(2.2) 22%; = 1 


2 has at most, and in general exactly, n distinct critical values. When the number 
is n, the critical values are taken on at n pairs of diametrically opposite points of 
(2.2), determining n mutually perpendicular lines through the origin in the number 
space of the x’s. If the pairs are ordered according to the algebraic values of z, 
at either point of the i-th pair z has a non-degenerate critical point of index i — |. 

Proof. We begin by making an orthogonal transformation with fixed origin 
in (x)-space so that the given form becomes (using the same letters 7, --- . 2, 
(2.3) 2=bait+---+b,2, = bz; 
with the b’s real.’ Now if we consider the function 

Received February 24, 1937; presented te the American Mathematical Society, March 
26, 1937. 

' Marston Morse, The Calculus of Variations in the Large, Amer. Math. Soc. Colloquium 
Publications, vol. 18, p. 143. 

Cf. L. P. Eisenhart, Riemannian Geometry, Princeton, 1926, p. 110. We shall refer to 
this volume as Eisenhart. 

* Repetition of an index indicates summation from | to n. 

‘Cf. M. Bécher, /ntroduction to Higher Algebra, p. 170, Theorem 1 and p. 171, Theorem 2; 
or L. E. Diekson, Modern Algebraic Theories, p. 74, Theorem 10. 
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(2.4) w = b:xi/z;2;, 


we see that w = z on locus (2.2). Using the fact that w is constant along 
each line directed towards the origin, omitting the origin itself, we easily see 
that those of the critical points of w, as a function of n independent variables, 
which are located on (2.2), are the same points as the critical points of z on (2.2) 
as a function of n — 1 independent variables. 

Then to find the critical points, using (2.4) we set 


_ dw _ (xj2;)(2b. a4) — (b; x7) (22,) 


0 — = 
Or, (x;2;)* 


(do not sum s). 


Hence the critical points of z on (2.2) are the points on (2.2) where 

(2.5) b,x, = (b;277)z, (s = 1, +--+ , n; do not sum s). 
Obviously 

(2.6) 2; = bn; (Gj =1,---,n)s 


satisfy (2.2) and (2.5) for any fixed k. Taking k = 1, 2,--- , m, we have n 
solutions of (2.5) and (2.2). 

We shall now show that (a) the number of distinct critical values of z on (2.2 
equals the number of distinct b’s in (2.3), and (8) if that number is n, then 


(2.6) gives the only critical points other than those with 7; = —6;;. 
Suppose 
(2.7) by = be =—=--s-e = b, ‘ 


but no other b = b,. If we take a solution of (2.2) and (2.5) with one or more 
of x, %2,°:*: , x, different from zero, say x, + 0, then if, say, 2,44 + 0, from 
(2.5) with s = lands = r + 1 we would have 


bz; = db, = brs, 
contrary to hypothesis. Hence, for the solution in question, 
(2.8) O = fui = °° = In 
and hence 
(2.9) l=a7i+---+2%. 


From (2.7), (2.8), (2.9) and (2.3) we now see that the critical value in question 
is bh} = bo = --- = b,. Since a similar argument holds for each set of equal 
b’s, our assertion (a) follows at once. 

If we have n distinct critical values, then by (@) the number of distinct b’s 
must be n, and hence for each critical point only one of the z’s can be different 
from zero (ef. (2.8)). Thus (8) is established. The perpendicularity of the 
directions also follows. 


6; = Oifk #j, = 1ifk =j. 














CRITICAL CURVATURES IN RIEMANNIAN SPACES 475 


Consider the solution (2.6) with k = 1. It is (1, 0,--- , 0). At this point 
ty, °** ,2, can be taken as the independent variables for (2.2).° Substituting 
for z; from (2.2) into (2.3), we have 


(2.10) 2 = bl — a3 — +++ — 2h) + bead + +++ + O27 
= bh + (bo — b;)x3 + (bs a b:)x3 +++ + (b,, = b,x, . 


Hence the index of the critical point (the number of negative coefficients) is 
the number of b’s less than b; . 

Since a similar result can be obtained by using (2.6) with cach of the values 
k = 2, 3,--- , n, we infer that if, say, bi < be < --- < b, , then, for each k, 


«; = 6; gives us a critical point of index k — 1, at which z = b,. We would 
obtam the same result by taking 7; = —&j,;. The truth of the theorem is now 
established. . 


3. Riemannian codrdinates. In this section we establish Riemannian coér- 
dinates’ for a Riemannian space without assuming that the coefficients of the 
fundamental form are analytic. If one is satisfied with the case that the coeffi- 
cients are analytic, this section may be omitted. 

THEOREM 3.1. If the fundamental form of a Riemannian space has coefficients 
of class C*, k = 4, neighboring a point P with coérdinates (a), a Riemannian 
coordinate system can be introduced, for a neighborhood of P, with origin at P, 
by a non-singular transformation of coérdinates in terms of functions of class C*™*.’ 

Proof. If we take 2), +--+ ,2, as the coérdinates, and g,;;dx,;dx; as the fun- 
damental form, the geodesics are the solutions of 


* The index of the critical point is independent of the particular parameters chosen, 
providing z is a function of class C? of those parameters. 

* A system of codrdinates y, ,--- , yx in a Riemannian n-space is called Riemannian 
if the geodesics through the origin are the curves given by the equations y; = X(t 
(i = 1, +--+, mj A, +--+, * any real constants not all zero). The theorem of this section 
has been proved by J. H. C. Whitehead, On the covering of a complete space by geodesics 
through a point, Annals of Math., vol. 36 (1935), pp. 679-704. Cf. footnote 5 of T. Y. 
Thomas, On normal coérdinates, Proce. Nat. Acad. Sci., vol. 22 (1936), pp. 309-312, where 
a proof of the theorem is based on results in an earlier paper by W. Mayer and T. Y. Thomas, 
Math. Zeitschrift, vol. 40 (1936), pp. 658-661. The proof was also given in some mimeo- 
graphed notes of W. Mayer at Princeton in 1936, for a more general variational problem. 
We are indebted to G. Comenetz for the above information. Since a simple proof has 
not yet been published all in the same paper, we think it advantageous to give one here. In 
the papers cited above, the hypotheses are equivalent to taking k = 2 in Theorem 3.1 in- 
stead of k 2 4. We assume k 2 4 to insure that the equations of the geodesics in the 
new coérdinate system have coefficients of class C'. 

* A function is of class C* if it is continuous and has all its partial derivatives, up to 
and including those of order k, continuous. 

* While we shall be dealing with the case that the fundamental form is positive definite, 
we do not make this assumption in this theorem. 
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dx; {ik\ dx; dx 


1) de + \if a dt 


= 0 (§ =1,-+-,n). 


Since {} is of class C’', the solution with 2; = 2? and (dx;/dt) = (dz, dt)o 
at t = t is given, according to a theorem of differential equations, by 

(3.2) 2: = odt, to, 21, °°* . 2, (dri/dt)y, --+ , (dx,/dt)o] 

oilt, ty, x°, (dx/dt)o] (¢ = 1,---,n), 


1 


Il 


with ¢; of class C*" in all the arguments," say for |t| <6 |b. < «6 
| xy —a;| < «, | (dxi/dt))| < « ¢€ > 0. Taking x? = a; and & = 0, we have 


the geodesics through (a) represented by 
(3.3) r; = ot, 0, a, (dx/dt),| (¢ = 1,---,n), 


with ¢; of class C*" for |t| < €and | (dz;/dt))| < ¢. Note that, since x; = a; 
is a solution of (3.1), from the uniqueness of the solution we have 


(3.4) ¢,(t, 0, a, 0) = a; (¢ = 1,---,n),|¢! << 


Now let us make a change of independent variable, f = ct, ¢ # 0, constant, 
replacing (3.1) by 


d° x; Jak dx, day 


_— d& * \if @ di 


= 0 (4=1,---,n). 


Since (3.5) is of the same form as (3.1), the solutions of (3.5) through (a) and 
with & = 0 are given by 


(3.6) 2; = ot, 0, a, (dx/db)o) 
= ¢,{ct, 0, a, (dxr/dt)o/c), ct| <.«, | (dx/dt)o/e. < «. 
Since (3.5) is equivalent to (3.1), we have, from (3.3) and (3.6), 
(3.7) oi{t, 0, a, (dx/dt)o| = o{et, 0, a, (dx/dt)o/c] (@§@ = 1,--+.n) 


: a, * : 12 
if every argument except the a’s is less than ¢€ in absolute value. 


a oe! ih fa ‘ ) " 
10 {ik = 9 7] = * a + = oe), if we use the customary notation. Cf. 


Eisenhart, p. 50, equation (17.8) and the proof following that equation. In the proof, in 
place of covariant derivatives with respect to z* multiplied by dzx*/ds and summed for k, 
absolute derivatives with respect to s should be used, as the functions differentiated are 
not functions of the z’s. Cf. our footnote 17. 

1 See G. A. Bliss, Solutions of differential equations of the first order as functions of their 
initial values, Annals of Math., (2), vol. 6 (1905), pp. 49-68 (theorem on p. 67); or Bolza, 
Variationsrechnung, 1909, p. 178. 

12 The work is carried this far in Duschek-Mayer, Lehrbuch der Differentialgeometrie, 
vol. 2, pp. 93-94, but it is not completed there. Cf. footnote 2 on page 95 there. We 
shall refer to this volume as Mayer. (Vol. 2 is by W. Mayer.) See also G. A. Bliss and 
Max Mason, Fields of extremals in space, Trans. Amer. Math. Soe., vol. 11 (1919), pp. 
325-340. 
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Now place the restriction |t| < » = ¢/2. For any sucht = 0, if c = ¢/2t, 
all the arguments in (3.7) except the a’s will be numerically less than ¢ provided 

(dx;/dt)o| < ¢«. Hence 
(3.8) oft, 0, a, (dx/dt)o] = die/2, 0, a, 2t(dx/dt)o/€] 

= V{t(dx,/dt)o on t(dzn/ dt)o| 

for 0 < \t| < n, | (dxi/dt))| <n. Whent = 0 the left-hand member of (3.8) 
is a,. The second member is also a;, by (3.4), when t = 0. Hence (3.8) 
holds also when t = 0, and thus the solutions of (3.1) are given by 
(3.9 y= Wilt(dx,/dt)o pe t(dx,,/dt)o] (a = 1,--- ’ n) 


with y; of class C*™ for | t| < , | (dx;/dt)o| < n. 
Now differentiate each side of (3.9) with respect to t and set = 0. Denoting 


by wm. °-** , y. the arguments of y; , we have 

(3.10) (dx;/dt)o = [ap(0) /dy ;|(dx ;/dt)o (¢ = l, , n). 
Since (3.10) are identities in the (dx;/dt)o’s, we infer 

(3.11) ay (0) /day; = 4;; (i,j = 1,---, nm). 


Now consider the transformation 


(3.12 4 = Vi(y “he » Bad (2 wf, ++ ’ n). 


From (3.11) we see that, for some ¢ < 9-7 = 7°, ¢ > 0, this transformation is 
one-to-one for | y;| < &. Since the y; are of class C*, the new fundamental 
form g;;dy:dy; will have coefficients of class C*~*; and since k — 2 = 2, the 
geodesics are uniquely determined in the new coérdinate system. 

Now consider the equations ’ 


(3.13) Yi = rt (i —= k, ie ,n), 


where \', +--+ , \” are (real) constants with | A‘! < » and 0 < \‘X’. The curve 
(3.13) is given in the (x)-system by 


(3.14) a; = wAr't, ---, A"d), \t| <9. 


Hence, comparing with (3.8), we see that it is the geodesic through (a) in direc- 
tion (\’, --- , 4"). We now see that the restriction | \‘ | < 7 can be dropped, 
provided that for each set (A) we restrict ¢ sufficiently. Thus all the geodesics 
through P are the curves given in the new coérdinate system by equations of 
the form (3.13) with 0 < \‘A‘. It follows that the new system is Riemannian, 
with origin at P, and the proof is complete. 


4. Hypersurfaces in a Riemannian space. Here we take the case of a surface 
of dimension one less than that of the space. 
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Derinition. The locus of a set of simultaneous equations 
o(x) = o(%1,°°+ tm) = O (@=1,--- ,k:k < m) 


is called a regular (m — k)-spread of class C’ neighboring a point (x), -++ , 2), 
in a Euclidean or Riemannian space with coédrdinates (x), if (2°) lies on the 
locus, the functions ¢; are of class C” near (2°), r = 1, and the matrix of the 
first partial derivatives is of rank k at (2°). 

TueoreM 4.1. If P is a point on a regular n-spread S, n > 1, of class C 
in a Riemannian (n + 1)-space R with positive definite fundamental form having 
coefficients of class C*, the normal curvatures" of curves on S through P, as functions 
of parameters determining their directions, have at most n critical values. When, 
as is in general the case, the number is n, the values are taken on in n mutually 
perpendicular directions; if the critical values are ordered according to their alge- 
braic values, then the i-th is at a critical point of index i — 1. 

Proof. Using Theorem 3.1, we introduce Riemannian normal codrdinates’” 
with origin at P in such a way that the surface x,,, = 0 is tangent at P to S, 
Then, if we denote x,,, = 2, S is given by 


(4.1) z= f(a oe . Baas f of class ©, 
with 
(4.2) See Pee (0, «++, 0). 

Ox, OX», 


Now let any unit vector (A) be given at P, so that 


(4.3) NA’ = 1. 
Let 

rv; = $(s) (gan J, +++ me), 
(4.4) 


z = filoi(s), --- . o.(s)] 
be a curve, with s the are length, s = 0 at P, such that 


13 For a curve on S through P with non-zero first curvature, the vector in the direetion 
of the principal normal and of length equal to the absolute value of the curvature will 
be called the curvature vector. If the curve has first curvature zero at P, the curvature 
vector is the vector with all components zero. The normal curvature of a curve through 
P on S is the projection on the normal direction to S at P, with a sense assigned to the 
latter, of the curvature vector. All regular curves of class C? on S tangent to a given 
curve on S at P have the same curvature vector. Cf. Eisenhart, p. 151. See also our 
Theorem 5.1 and the definition preceding it. For a curve whose curvature vector is 
orthogonal to S, the normal curvature is plus or minus the absolute value of the actual 
first curvature. 

4 This is well known. Cf. Eisenhart, p. 153. 

1 Riemannian normal coérdinates are Riemannian codrdinates such that g;; = 4,, at 
the origin (Eisenhart, p. 55). They are easily obtained from any Riemannian coérdinates, 
when the fundamental form is positive definite, by a linear transformation. The nota- 
tion in Mayer is slightly different. 
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(4.5) ¢;(0) = X' (i = 1,---,n)."* 
Since we have normal coérdinates, for curve (4.4) at P we have 


2 

6 dXa d dz._ d Ta ( | + 1) 17 

= cae OS tp cat OE eee ss ere 

és ds ds ds ds* : s 
Hence the curvature vector of the curve at P has its a-th component” d°r,/ds" 
(a = 1,+:+,n+1). Hence the normal curvature, projection of the curvature 

. . _— . . 2 ; 2 ~4 , 2 y 

vector on the direction of the positive z-axis, is d°2,.,/ds" = d°z/ds°. Now 


dz _ af dd; 
ds aa; ds’ 


and therefore at P the normal curvature 
Pe 2 ) . : as 
an = TF _ 780) as tes _ 90) 
ds’ ax;0x; ds ds Ox; OX; 
because of (4.2). Hence 


’ 


(4.7) Kumada, alee. 


Theorem 4.1 now follows from Theorem 2.1. 
We observe that, as a special ease, the (n + 1)-space may be Euclidean. 


5. Subspaces of a Riemannian space. In this section we take the case of an 
n-dimensional surface S in a Riemannian m-space, m > n, dropping the restric- 
tion that m = n+ 1. The idea, used here, of projecting the curvature vector 
on a direction orthogonal to S is found in Mayer,” used in another connection. 

Let a Riemannian m-space R be given, with positive definite fundamental 
form having coefficients of class C*, the variables being y,,---, ym. Let S 
be a regular n-spread of class C* neighboring a point P of R. Then S is itself 
a Riemannian n-space, whose fundamental form has coefficients of class C'.” 


'® For example, we could take the curve x; = AU (i = 1, ---,m), z = flr, ---, AM). 
Then at P the quantities dz,/dt (a = 1, --- ,m + 1) are (A', --- , A", 0). If now we change 
the parameter to arc length s, measured in the direction in which ¢ increases, the quantities 
dz,/ds are proportional to dz,/dt, and also have the sum of their squares equal to 1 at P; 
hence they are also (A', --- , \", 0). Hence dz;/ds = ¥' (i = 1, --- , m), as was to be proved. 

'7 The letter 6 indicates absolute (covariant) differentiation. Cf. Mayer, p. 31 ff. At 
the origin in Riemannian coérdinates, absolute first derivatives equal the usual first 
derivatives. Cf. Eisenhart, p. 56, for derivatives with respect to the space codrdinates. 
For derivatives to any parameter (e.g., as s above), the result follows immediately from 
the fact that at the origin the first partial derivatives of the coefficients of the fundamental 
form are all zero (Eisenhart, p. 55; Mayer, p. 117). 

18 Cf. Eisenhart, p. 61; Mayer, pp. 59-62. 

1% Pp. 159-160. 

2° If we want S to be sufficiently regular so that the geodesics on S exist and are unique, 
we can demand that S be of class C*. Its fundamental form will then have coefficients 
of class €?. 
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We begin by establishing a property of curvature, following the next defi- 
nition. 

Derinition. The projection of the curvature vector of a curve in a direction 
normal to the curve at a point on it will be called the normal curvature of the 
curve for the given direction. 

THEorEM 5.1. All regular curves of class C’ on S tangent to a given curve of S 
at a fixed point P have the same normal curvature for any direction normal to S. 

Proof. Introduce Riemannian normal coérdinates in R so that the locus 
0 = Yns1 = *** = Ym becomes tangent to S at the origin P. As the new 
coordinates are obtained by a transformation using functions of class C°, the 
locus S is now given by 


(5.1) y=hly,-*+,y), feofclassC, (k=n+1,---,m), 
where all the first partial derivatives are zero at (0,--- ,0). Let 
yi = pi(s) (i = 1,---,n), 
(5.2) 
Ye = Sul~—r(s), +++ , pn(s)] (k= n+ 1,--+,m) 


be one of the curves in question, with s the are length. It is easily verified that 
the p’s must be of class C’. 

Since we have normal codrdinates, as in $4 the curvature vector has a-th 
component d’yq/ds’ (a = 1,-++,m). Hence its projection on the direction 
(0,--: 0,1) is d’y,,/ds’. Now 


dYm — S Ofm dpi 


ds i=1 OY; ds 
and therefore, at P, 


(5.3) UYm _ SO Sm api dp; 


ds? St, ay;dy; ds ds’ 
since daf,./dy; = Oat P. As this answer depends only on the direction of the 
curve (5.2), and since any direction normal to S can be made the direction 
(0, --+ , O, 1), we infer the validity of the theorem. 

We now return to the principal question. 

Tureorem 5.2. Let P be a point on a regular n-spread S of class C* in a Rieman- 
nian m-space, 1 <n <‘m, with positive definite fundamental form having coeffi- 
cients of class C*. If normal curvatures are taken at P for any given normal 
direction to S at P, the conclusion of Theorem 4.1 holds. 

Proof. Choose coérdinates as above, making the given direction the direction 
(0, --- , 0,1). Since (5.3) is the same kind of result as (4.6), the brief proof 
is the same as that following (4.6). Hence the theorem is true. 

Remark 1. All regular curves on S tangent to a given curve on S and with 
curvature vectors orthogonal to S or having all components zero have the 
same curvature vector. 
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This is a known property,” which we state for convenience in reference. It 
also follows easily from Theorem 5.1.” 

Remark 2. If, under the hypotheses of Theorem 5.2, S being supposed 
of class C’, all the geodesics of S through P have their curvature vectors 
confined to two opposite directions when non-null, then, in Theorem 5.2, the 
normal curvatures, for either of those directions, of all curves whose curvature 
vectors are orthogonal to S are plus or minus the absolute values of the 
curvatures. 

This follows from Remark 1 and the fact that when a geodesic has a non-null 
curvature vector, the latter is orthogonal to S.” 


6. The Ricci mean curvature. ‘The sum of the » — | Riemannian curvatures 
determined by a direction (A’,--- , \") in a Riemannian n-space R and each 
of n — 1 other vectors such that all n are mutually orthogonal is called the 
mean curvature, p, of the space for the given direction, and is given by 


_ Riv 


6.1 = ; 
( ) p gig’ ? 


where R;; and g;; are components of the Ricci tensor and the fundamental 
tensor respectively.“ If, as in §4, we introduce Riemannian normal coérdinates 
with origin at a given point P where we are considering (6.1), and take (A) 
as a unit vector, at P (6.1) reduces to 


(6.2) p = —R,,\'W. 


We can now apply Theorem 2.1. This gives us the following 

TuHeoreM 6.1. If P is a point in a Riemannian n-space with positive definite 
fundamental form having coefficients of class C*, the mean curvatures at P, as 
functions of parameters determining the direction, satisfy the conclusion of 
Theorem 4.1. 


Appendix — The critical diameters of central quadrics” 


7. Central quadrics. The locus S of a second degree equation in Euclidean 
(x1, °*** ,%)-space will be called a central quadric if it is not vacuous and is 
symmetric in a point P not on it, called a center. For example, if n = 2 the 
central quadrics (now conics) are of the following types: circle, ellipse, hyper- 
bola, pair of parallel straight lines. In general a quadric surface is symmetric 
and has only one center. In any case, a particular center will be chosen and 
called the center. 


1 Cf. Mayer, pp. 158-159. 

22 By Theorem 5.1, the projections on the yn.:, ---, Ym directions are the same for all 
the curves, and as the projections on the y,, --- , yn directions are zero, the vectors are 
all the same. 

°3 Kisenhart, p. 75; Mayer, p. 159. 

4 Cf. Eisenhart, p. 113. 

* This part was originally presented to the Society under its own title. 








482 ARTHUR B. BROWN 


In the earlier part of this paper we had occasion to consider the value of an 
arbitrary real quadratic form a;;2;2; on the locus z;2; = 1. This suggests con- 
sidering the value of x;2; on the locus a;;2;2; = 1. Our principal result is that 
if for a central quadric in n-space the lengths of the diameters, as functions of 
parameters determining the direction, have critical points in only a finite 
number of directions, the 7-th in magnitude is at a non-degenerate critical 
point of index 7 — 1. 


8. Preliminaries. If the center is at the origin, 2,2; is one-fourth the square 
of the diameter with end points (2) and (—z). As we prefer to consider the 
diameter itself, we begin with the following lemma. 

Lemma. Suppose f(x: ,-++ ,2n) is of class C’ neighboring (x°) in real n-space, 
g(x) = [f(x), and f(x’) > 0. Then if either f or g has a non-degenerate critical 
point at (x°), the other has one of the same index. 

Proof. Since dg/ax; = (2f)(af/ax,), if either has a critical point so has the 


other. Differentiating again and setting 2; = x! (¢ = 1,--- ,n) we have 
2 U 2 0 
( . 
a g(x) = 2f(2") a f(a ) 
Ox; OX; Ox; OX; 


since af(x”)/ax; = 0. The conclusion now follows easily from the definition of 
. OY > ‘i 
index.” We shall apply the lemma with n replaced by n — 1. 

Now let us consider the principal question. Suppose a central quadric S is 
given in n-space, n > 1, and we make a translation of axes so that the center 
(or a center, which will be designated hereafter as the center) becomes the 
origin. The new equation has no first degree terms and as in §2 we can make 

. ° a ae ° 2 . 
an orthogonal transformation,” giving us the new equation c;7; = d. Since 
the origin is not on S, d # 0 and can be made unity. We can easily arrange 
that S has equation 
(8.1) xi tees + aeae — Oe tin — te — jb = 1 
with the a’s all positive and the 6’s positive or zero. Here k 2 1 since S is not 
vacuous. Note that ¢ = a) .°°+ .¢e = Qh, Cea = —dea. +++ tn = —D,. 

We now seek the critical points of 
(8.2) 2 = 2&2; 
on the locus (8.1), as a function of m — | parameters. A line through any 
point on (8.1) directed towards the origin is easily seen not to be orthogonal 


to (8.1). Using this fact we easily prove that, as in a similar situation in §2, 
it is sufficient to find the critical points located on (8.1) of the function 


(8.3) w = (x#,;x,)/(e:27) 


26 Tf [d*f(x°)dx;dx;\-xe;rj = —yi — ++ — YEH vias Hes + yk under a non-singular 
linear transformation, the index is k. 

27 The determinant of the coefficients can be made +1 if we like, so that the transforma- 
tion is a “rigid motion’’. 
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of n independent variables. Upon differentiation we find that they are the 
points satisfying (8.1) and 


(8.4) 2, = (x, 7,)c. 2. (s = 1, +--+, n; do not sum s). 


Since 2;x; # 0 on (8.1), at any critical point at least one x, # 0, hence by 
(8.2) and (8.4) z = l/c... Since ces: , +++ , Ca are not positive, the number of 
distinct. critical values of z is seen to be at most k, hence finite. 


9. The theorem. A diameter in a direction where the length has a critical 
point will be called a critical diameter. 

Turorem. I[f a central quadric in n-space, n > 1, has only a finite number of 
critical diameters, they are mutually orthogonal and the i-th in length is at a non- 
degencrate critical point of index i — 1. 

Proof. Suppose, in (8.1), a; = a2. Then any point (x , 22, 0,--- , 0) such 
that 2} + 23 = 1/a; = 1/a, = 1/e, = 1/c2 would satisfy (8.1) and (8.4), hence 
determine a critical diameter. The number of critical diameters would then 
be infinite, contrary to hypothesis. We infer that the a’s are all distinct. 

In §8 we saw that if x, ¥ 0 at a critical point, z = 1/c, at the point. Hence 
s S k, and since ¢, , «++ , c; are all distinct, at most one z is different from zero. 
Hence the critical diameters are mutually orthogonal. Since the 2z’s cannot 
be all zero, we infer that there are just k critical diameters, one along each of 
the first / codrdinate axes. 

Consider the solutions of (8.1), (8.4) with x, # 0. They are 


(+1/(a;)', 0, «++ , 0). 


We choose either sign. Neighboring the point chosen, x2, +--+ , 2, can be taken 
as independent variables for (8.1). Substituting from (8.1) into (8.2) we obtain 


= I + (: — “20 fe ose (: oak 5a + (: + betta 
ay ay ay ay 
b,, 3 
tee t(i4 et. 
ay 


Since none of the coefficients is zero, the critical point is non-degenerate and of 
index equal to the number of a’s larger than a, , hence to the number of (1/a)’s 
smaller than (1/a,). The critical value is 1/a,;. Since a similar result is 
obtained with x; replaced by each of xe, +--+ , 2, we infer the truth of the 
theorem. 

Remark. I[f S is a central quadric not a surface of revolution, the hypotheses 
of the theorem are satisfied. For we have seen that the distinctness of the a’s 
implies that the critical diameters are mutually orthogonal, hence finite in 
number. 

On the other hand, a surface of revolution may satisfy the hypotheses of the 
theorem, for example, any surface for which the a’s are distinet but two of the 
b’s are equal. 


CoLumBia UNIVERSITY. 








THE CHARACTERISTIC ROOTS OF A MATRIX 


By W. V. ParKER 


1. Introduction. If A is a square matrix of order n and J is the unit matrix, 
the equation obtained by equating to zero the determinant | A — XJ | is called 
the characteristic equation of A. The roots of this equation are called the 
characteristic roots of A. 

If A is a matrix of a particular type, certain definite statements may be made 
concerning the nature of its characteristic roots. For example, if A is Hermitian 
its characteristic roots are all real. While it is not possible to make any definite 
statement about the nature of the characteristic roots for the general matrix, 
several authors have given upper limits to the roots. The first of these limits 
seems to have been given by Bendixson' in 1900. He obtained upper limits 
for the real and imaginary parts of the characteristic roots of a real matrix. 
In a letter to Bendixson in 1902, Hirsch’ extended these results to include the 
case when the elements of A may be complex numbers. Hirsch obtained an 
upper limit for the characteristic roots as well as for their real and imaginary 
parts. A limit was also given by Bromwich’ in 1904. In 1930, Browne‘ ob- 
tained limits which do not exceed those previously found and are in general less. 

In the present note it is shown that the limit for the characteristic roots can 
generally be determined to be less than the one given by Browne. A lower 
limit for the characteristic root of greatest absolute value and an upper limit 
for the characteristic root of least absolute value for an Hermitian matrix are 
also found. 

Let A’ and A denote the transpose and conjugate, respectively, of the square 
matrix A and write 


B= 


It is evident that B and C are Hermitian; that is, B = B’ and C = C’. A 
theorem given by Browne may be stated as follows: 

Browne’s Tororem. If R;, R; , and R’ are the sums of the absolute values 
of the elements in the i-th row of the matrices A, B, and C, respectively, and if T; 


Received March 10, 1937. 

1 Bendixson, Sur les racines d’une équation fondamentale, Acta Mathematica, vol. 25 
(1902), pp. 359-365. 

? Hirsch, Acta Mathematica, vol. 25 (1902), pp. 367-370. 

3 Bromwich, On the roots of the characteristic equation of a linear substitution, Acta 
Mathematica, vol. 30 (1906), pp. 295-304. 

4 Browne, The characteristic roots of a madrizx, Bulletin of the American Mathematical 
Society, vol. 36 (1930), pp. 705-710. 
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is the sum of the absolute values of the elements in the i-th column of A, and if 
R, R', R", and T are the greatest of the R; , R; , R!, and 1; , respectively, then for 
any characteristic root, \ = a + iB, of A we have 

< R+ 17 


a ee 


s R’, B| = R". 


2. An upper limit to the characteristic roots of A. If \ = a + 78 is a char- 


acteristic root of a matrix A = (a,.) of order n, there exists a set of numbers 
(a, Ye, °** , 2%) such that > r,&, = 1, which satisfy the relations 
r=1 
(1) AZ, = Dd an 2, (r = 1,2,---,m). 
s=1 


If we multiply the r-th equation in (1) by #, and sum as to r, we get 


(2) = - Rte ts 


ra=l 


If in (2) we take conjugates on both sides and interchange the subscripts, we get 
(3) K= Do a,x. 

ra=l 
From (2) and (3) by addition and subtraction we get 


(4) a= > Retr Rss 
r3=l 


(5) B= Dd exit. 
ral 


From the relations (2), (4) and (5) we determine upper limits for | \ |, | @ | and 
8 |. Since these relations are identical in form, it is sufficient to carry the 
computation through for one of them. We shall write 


R, = > Ja,|, T, = D \a,|, 28, = R, + T,, 


s=l r=1 


and let & = | 2, so that p = landé,g, < (6 + &). If we take absolute 
ran! 


values in (2), we get 


where S is the greatest of the S,. Similar inequalities hold for @, and | 8 
and we have the following theorem. 
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Turorem 1. Jf A is any square matrix, and if 2S,, 2S; , 2S) are the sums of 
the absolute values of the elements in the r-th row and the absolute values of the 
elements in the r-th column of A, B, and C, respectively, and if S, S’, S” are the 
greatest of the S,, S;, S.’, respectively, then for any characteristic root, \ = « + iB, 
of A we have 


> 
WA 
x. 
- 
A 
nH 
— 
A 
‘A 


The latter two limits are those given by Browne but the limit for | \ | is gen- 
erally less than his. 


3. The characteristic roots of AA’. Browne’ has shown that if \ is a charac- 
teristic root of a square matrix A and if G@ is the greatest and s the smallest of 
the (real and not negative) characteristic roots of AA’ then 0 < s < Mi S G. 

Let U, be a square matrix of order n whose elements are determined by the 
following conditions. The elements of the r-th column (ra definite number) are 


. 4+. —} = -} 
UyO, , Ad, ,*** » Amn, , 
where 


(6) o, = >) Gndn > 0° 


i=] 


The elements of the j-th column (j7 # r) are 


fa, tie, peta as » Lins 
where 
Dank =0 (j #4), 
t=] 
(7) n 
> Tut = 6; (1,7 #r). 
t=1 
The matrix U’, thus determined is unitary, that is, UU, = I. If P, = AU,, 


the elements of the r-th row of P, are 


Pri = D> an Fit => 0 (j x r), 


t=1 
Pr a IP O44), ‘= o>. 
t=] 
. . a 1 . . . . 
It is evident, therefore, that o2 is a characteristic root of the matrix P, = Al’,. 


5 Browne, The characteristic equation of a matriz, Bulletin of the American Mathematical 
Society, vol. 34 (1928), pp. 363-368. 

* It is assumed here that not all elements of the r-th row of A are zero. If all elements 
of any one row are zero, then zero is a characteristic root of A and the theorems as stated 
here are still true. 
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In a similar way we may construct a unitary matrix V, such that Q, = V,A 
. . 1 
has the characteristic root 7} where 


(8) % = i Ot. Ats . 
By taking products we see that 
P,P, = AU,U,A’ = Ad’, 
0.9. = A'VLV.A = A’A. 
Hence the characteristic roots of P,P’ and Q/Q, are identical with those of AA’. 


These roots are real and not negative. If G is the greatest and s the smallest 
of these roots, we have, as shown by Browne 


s So, Gands Ts 


lA 


WA 
a 


Hence we have the following theorem. ‘ 

TuHeorEeM 2. If o,(7,) is the sum of the squares of the absolute values of the 
elements of the r-th row (column) of a square matrix A, and if o (1) is the greatest 
and o'(r’) the smallest of the o,(r7,), then AA’ has a real positive (or zero) charac- 
teristic root not less than the greater of « and +r and a real positive (or zero) char- 
acteristic root not greater than the smaller of o' and r’. 

In particular, if A is Hermitian, its characteristic roots are all real and the 
characteristic roots of AA’ are the squares of the characteristic roots of A. If 
the characteristic roots of AA’ are Aj >AZ S--- = ll we have 


” ” 
MZe2edc Zi., 


and hence 


Therefore we have the following theorem. 

THEOREM 3. If o, is the sum of the squares of the absolute values of the elements 
of the r-th row of a Hermitian matrix A, and if o is the greatest and a’ the least 
of the o,, then A has at least one characteristic root whose absolute value does not 


} a , 

exceed (o')’ and at least one characteristic root whose absolute value is not less 
i 

than a’. 


4. A lower limit for the characteristic roots of A. A matrix has the character- 
istic root zero if and only if it is singular. It is well known that if A is a 
characteristic root of a non-singular matrix A, then 1/A is a characteristic root 
of A-’, the inverse of A. If |1/A} S L, then || = 1/L and hence an upper 
limit for the characteristic roots of A~' determines a lower limit for the char- 
acteristic roots of A. 


LoursiANA Stave UNIVERSITY. 








COMPLETELY MONOTONE FUNCTIONS OF THE LAPLACE 
OPERATOR FOR TORUS AND SPHERE 


By S. BocHNER 


In the present note we shall discuss some properties of the Laplace operator 


| a FY 
(1) Ag = — bs ( +++ + »)a 


ari 
on the torus 
(2) -$32.<} (x = 1,--- ,k), 


that is, for functions having the period 1 in each variable, and corresponding 
properties for the Laplace-Beltrami operator on the sphere of positive constant 
curvature. 

PartI. The torus 


1. We introduce on the torus (2) the Hilbert space of functions of integrable 
square. It is the family of functions 


(3) f(z) ~ DY +++ DS aay. .ng exp [2ri(mizy +--+ + ner)) 
for which 


(4) PE of aes 


At the outset, the operator (1) is defined only for functions which are differen- 
tiable twice. As such it is a positive semi-definite Hermitian operator with 
characteristic functions 


(5) exp [2ri(maz, + ++: + meay)] 
belonging to the characteristic values 
(6) nites: + ni. 


In accordance with a general theorem referring to the nature of the Laplace- 
Beltrami operator on a compact Riemann space, our initial operator has a 
unique self-adjoint (hyper-maximal) closure with the same spectrum.’ In what 
follows we shall be interested in this closure only and we shall denote it by Ag. 
The spectrum of Ag is non-negative. Let ¢(p) be an arbitrary real continuous 


Received May 4, 1937. 
'Cf. S. Bochner, Analytic mapping of compact Riemann spaces into Euclidean space, 
this Journal, vol. 3 (1937), pp. 339-354. 
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function in the half-line 0 S p < «. According to a general theorem on 
functions of operators’ we can form the operator g(A)g. It is again self-adjoint 
and, in our case, it is uniquely determined by the fact that it again has the 
characteristic functions (5), the corresponding characteristic values being 


e(ni +--+ + ni). 
If y(A)f is defined for an element (3), then its value is 


(7) > g(ni + ee) + Ni)Gn,---np exp [2r7(mai + --* + m2,)], 


and it is defined for an element (3) if and only if (7) is again an element of the 
Hilbert space, that is, if 


D lott to $n) Playa <=, 
For example, if 
” g(p) = 


then (7) assumes the form 


= c? c > 0, 


ol 
ny ---mp 


“Smif rrs M+ 
and in this case the operator g(A)f is the inverse of the operator (A + c)g; 
that is, it represents the solution of the equation 


(9) (A+ c)g =f. 


Purely formally the expression (7) can be written as an integral operator. 
Introduce the Green’s function 





- exp [2ri(miay + +++ + nex), 
c 


(10) G(r) = D olni + «++ + nd) exp [2ri(ma + ++: + max], 


then 
(11) (ayy = | vo f Gla, — Bay 20 te — BIEL, + Bb 


Our aim is to discuss a class of functions ¢(p) for which this representation 
is valid. 


2. We assume that ¢(p) is completely monotone in 0 S p < ~. This means 
that ¢(p) has derivatives of all orders in 0 < p < ~ and that 


(—1)" 9) > 9 (n = 0,1,2, -*:). 
dp” 


2M. H. Stone, Linear Operations in Hilbert Space, 1932, Chapters VI, VII 
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° * ° ye 3 ° ° ° 
By an important theorem of 8. Bernstein and Widder’ any function which is 
completely monotone in a half-line p < p < © can be represented uniquely 
as a Laplace integral 


(12) ¢(p) -[ e * dealt) 


in which the function a(t) is monotonely non-decreasing. Since, by our assump- 
tion, ¢(p) is bounded to the right of p = 0, we have 


(13) [ da(t) < ~; 
0 
this assumption will be somewhat relaxed later on. Another restriction which 
will be required throughout is the condition 
(14) a(+ 0) = a(0); 


thus the function a(t) shall have no discontinuity for t = 0. The rdéle of this 
restriction is obvious; it excludes the function g(p) = 1 for which no Green’s 
function exists. As a consequence of this restriction we have the limit relation 


(15) g(p) = lim ¢a(p), 
where 
(16) Yalp) = | e * datt), a> 0. 


Obviously 
(17) gulp) = Ole”), p> %, 
Postponing questions of convergence, we form the integral 


(18) Hla, +++, t%) 
xz 
= | tae | etni foes + nj) exp [2m7(miai + ++ + newed|dm +--+ dre, 
x 
and we observe that, by Poisson’s summation formula,’ it is connected with 
the function (10) by the relation 


(19) G(ay, +++, t%) = . tee > (a, + mi, -**, Te + me). 


Substituting in (18) 


g(nt +--+ +n) = | exp [—(ni + --- + n>Dtlda(t), 


0 


’D. V. Widder, Necessary and sufficient conditions for the representation of a function as a 
Laplace integral, Trans. Amer. Math. Soc., vol. 33 (1931), pp. 851-892. 
‘Cf. 8. Bochner, Vorlesungen tiber Fouriersche Integrale, 1932, pp. 33-38, 203-205 
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inverting the integrations and making use of the formula 


I exp [—(n®t — 2xinzx)|dn = (xt) exp (- =) 


a 4 


we obtain 
(20) Hla, ---,%) =7 Z| exp | - wlat-: + |. * dat). 


Hence, introducing the function of one variable 


(21) H(r) =r af | exp | = re * det), 


we have 
H(a,, +--+ , 22) = Hl(xi +--+ + 23). 


The function H(r) which we consider for 0 < r < ©& is positive and non- 
increasing, by (20), and the function G(r) is positive, by (19). 

In order to justify these relations we first replace the function ¢(p) by the 
truncated function (16) and we denote the corresponding functions (10), (18), 
(21) by G(x), H.(x), H.(r), and the corresponding operator ¢(A)f by ¢a(A)f. 
Because of (17), the expressions (10), (18), (21) converge absolutely, relation (19) 
is valid, and, the function G,(x) being bounded, the right side of (11) has a 
meaning, and its value is (7), for any square-integrable function f(z). Thus 
relation (11) holds. 

We now assume that @ tends decreasingly to 0. Initially, the integral (18) 
and the sum (10) have no meaning for the function ¢(p) itself, but we define 
them by the limit-relations 


(22) H(x) = lim H,(2), G(x) = lim G,(2). 
a0 a0 
Since the functions H,(x), G.(x) increase when a decreases, the limit relations 


have a meaning (although the resulting functions might have values +), 
and relation (19) holds. We shall now use relation (15). In the first place it 
justifies relation (20), thus proving that //(r) is everywhere finite. But we have 
still to show the finiteness of (10) and the validity of (11). As long as (18) is 
absolutely convergent it admits the inversion formula, 


(23) g(ni +++ ni) 


< 


= | oes | H(a,, +++, a%) exp [—2ri(aim: + «++ + rene) |day +++ day. 


x 


In particular, 
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2 


¢a(0) a | ee | H,(x, eae ry) dx -++ dx. 
Since H,(x, «+: , 2%) and gq(0) tend increasingly to their limit-values, we obtain 
¢(0) = / ee | H(x,, +++, a) da, +++ da,.” 


Thus, the integral on the right is bounded. Since H(r) decreases as r increases, 
we readily conclude that the sum (19), but for the term H(x,, ++: , 2), con- 
verges uniformly in (2). Thus, in (2), G(a, --+ , rx) differs from H(a,, +--+ , 2x) 
by a bounded function only and, in particular, is finite. Since, by (10), 


1 
+3 


ea(0) -| vf Gilby -+ 80 dé ++ ds, 
c=} 


the integral on the right is finite also for the limit-function G(é,,--- , &). It 
is now easy to give a meaning to the integral (11) and to show that it has the 
value (7). As a consequence of (14) the function ¢(p) tends to 0 as p—> ~ and 
the value (7) of ¢(A)f shows that all our functions g(A) of A are completely 
continuous operators. 


3. We shall next discuss the order of infinity of G(x, --- , 2.) in the neigh- 
borhood of the origin in terms of the magnitude of ¢g(p) as p-—> ~. In the 
statements the function G(x , «++ , 2%) will be replaced by the function H(r). 


I. In order that H(r) be bounded (in the neighborhood of r = 0) it is necessary 
and sufficient that for some (and therefore every) a > 0 


(24) I g(p)p "dp < », 


In fact, all occurring functions being non-negative, we may conclude 


[ o(p)p ‘dp -[ [ aes p dp da(t) = (8) [ t *da(t) 
0 0 0 0 


= r(§) r-lim H(r). 
2 r—0 
To simplify writing we shall put 


K(r) = rd rt da(t) = en(e), 
0 us 


Similar to (25) we obtain, for 


(25) 


0<) < $k, 


5 See also S. Bochner, Monotone Funktionen, Stieltjesche Integrale und harmonische 
Analyse, Math. Annalen, vol. 108 (1933), pp. 399-408. 
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putting « = 3k — X, the relation 


l ™ A--1 l “i a —1 
ip = . Ir. 
05 L o(p)p dp ra K(r)r dh 
Hence, we have 


II. In order that for some (and therefore every) a > 0 
(26) i Hr? ""dr < @, 
0 
it 1s necessary and sufficient that for some (and therefore every) b > 0 


(27) [ o(p)p ‘dp < ~. 


Finally, we have 
Ill. If L(é) is any function in 0 < — < & such that, for every c > 0, 


(ct 
(28) lim a = 1 as §-0 and E>, 
then the asymptotic relation 
A 1 
(29) ¢g(p) ~ — L (4), p— @, 
p p 


implies an asymptotic relation 
(30) H(r) ~ 3, Le’) 0 
e pe-r 4\ ’ ! ? 


and vice versa. 
This follows from the following theorem.” 
If T(t) is non-decreasing, T(+0) = T(0), and [ e *' dT(t) is finite for s > 0, 
0 
then, for ¢ > 0, the relations 


(31) [ c“dT) ~s “1(*) vehediles 
0 8 ass-— &® 

and 

(32) mw. “= 


(eo + 1) asx— 0 
are equivalent. 
We shall verify, for instance, that (29) implies (30) or, what is the same, 
that (29) implies 


ee 
K(r) ~ — L(y), r— 0. 
r# 
*J. Karamata, Neucr Beweis und Verallgemeinerung der Tauberschen Sdtze, welche die 


Laplacesche und Sfielijesche Transformation betreffen, Crelle’s Journal, vol. 146 (1931), 
pp. 27-39. 
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By assumption 


| e “‘da(t) ~ . L (‘), s—> w, 
0 s s 


Consequently, by the quoted theorem, 
At‘ Lit) 
ra + 1)’ 


1 A l 1 


Defining the function 7'(¢) by the relation 


dT(t) = Hal -o( +) | 
A x, (i - 


K(s) -[ et *dalt) = [ e “dT, 


and therefore, since (32) implies (31), we obtain, replacing L(x) by L(x’), 


a(t) at t— 0, 


or 


we easily obtain 





But 


oy . ain 2 
K(s) ~ ——— -— L(s), s—0. 
‘s8) ra) (s) s—_ 
4. Finally we shall show that G(a,,--- , 2%) is analytic at all points of the 
torus except the origin and that for an analytic function f(a, --- , 2%) the 
transformed function ¢(A)f is again analytic. 
If 2}, --- , 2, is any fixed point on the torus different from (0, --- , 0), then 


the function H(xr) is analytic in some neighborhood of this point. In fact, 
given an e > 0 there exists a 6 > O such that 

am +e + laf elal+-- +i as 
for all complex values of the neighborhood 


02 


m1 — 1 | +--+ +la—al <e. 


Consequently, the integral (20) and the partial derivatives 


(33) wad =o [ (-"" *) exp (—* (xi pees + a) * de(t) 
OX « 0 t t 


exist in a complex neighborhood, thus proving the analyticity of H(a,, --- , 2%) 
at all points but the origin. Also, the argument in §2 shows easily that the 
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series (19) converges uniformly in our complex neighborhood; the result is that 


G(a, .--* , %«) is analytic at all points of the torus except the origin and that 
in the neighborhood of the origin it differs from the function H(2,, --- , x%) 
by an analytic function. Furthermore, in dealing with the singularity of 
H(x,.--- , x,) at the origin we may assume that ¢(p) has the form 
(34) g*(p) = | e dat). 

0 


This follows from the fact that the difference function ga(p) = ¢(p) — ¢*(p) 
(see (16)) has the order of magnitude 


ga(ni + --- + ni) =O (exp [— 44(ni + --- + nj)}) 


for a > O sufficiently large, thus making the corresponding function G,(zx) 
analytic by crude absolute and uniform convergence of series (10). 

We now assume that f(r, +--+ , 2¢) is analytic on the torus. If x runs over 
x complex neighborhood of a fixed point and & over a set S of the torus such 
that @(a — &) is analytic in the neighborhood of the fixed point, uniformly in & 
from S, then the integral 


J = | Ge OF @de--- ag 
is analytic at the given point z (even if f(€) is not analytic). Hence it is suffi- 
cient to show that 


(35) g(x) = f+ | Hie O40) de ++ dk 
is analytic at the origin if S, is some sphere tiit-::: + & <r’ and 


H(r) = [ exp | fa fiver yb #8) | dato, 


a > 0, ¢ > 0. Following E. E. Levi’ we replace in (35) the codrdinates 
f,--: , & by a type of polar coérdinates originating at the variable point z 
which is interior to S,. The new codrdinates consist of angular variables 
6,,°*: , & 4 and a radial distance p,0 <p <1. The angular variables do not 
depend on the variable point z. They are fixed variables on the surface of the 
sphere S,, such that the rectangular coérdinates m ,--- , % of the surface 
nit::: + m = Pr are fixed functions of 6,,--- ,0.:. But the radial dis- 
tance p does depend on z and is defined in the following way. Let » be any 
point on S, ; if & lies on the segment joining x and n, then 


& = 1% + e(m — 2.) (x = 1,--- ,b), 


7 Cf. E. Hopf, Uber den funktionalen, insbesondere den analytischen Charaktler der Lésun- 
gen elliptischer Differentialgleichungen zweiter Ordnung, Math. Zeitschrift, vol. 34 (1931), 


p. 224. 
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the quantity p being the length of the segment (z, &) divided by the length of 
the segment (x, ). If we introduce these coérdinates in (35), then the factor 
of H(x — &) is analytic in a complex neighborhood of x = 0, uniformly in the 
variables @,, +--+, 0:1, p, over which we integrate. Also, what is very impor- 
tant, the volume element contains the factor p "dp. As for H(x — &) itself, 
we can write it in the form 


(36) H(x — &) = [ exp | - (a1 —m)? +--+ - we * det) 


Since ni feee + ni = r, there exists a fixed complex neighborhood N of the 
point r = 0, such that in this neighborhood 


IV 


(a1 — m) + +++ + (te — md? 


Hence (36) is dominated by 


| exp |-% fs * det). 


Therefore (36) is analytic for x C N, uniformly in all 6. -+> , 1. and in 
e < p <7, for any fixed « > 0. Hence the function 


ger) = [+++ [| He — 9408) dé +++ a 


S;—S, 


is analytic for cr C N. But g(x) — g(x) is dominated by 


I [oom] - Feo toda, 


and this is a finite dominant independent of x. The function g(x), being the 
limit of boundedly convergent functions over a complex domain, is likewise 
analytic. 


5. The function (2, , +++ , 2,) entering the sum (19) is a Green’s function 


itself, namely, the Green’s function of the operator g(A)f not for the torus (2) 
but for the whole Euclidean space; compare its definition (18). For the special 


function 
. p+e 0 


we have 


H(r) = x" | exp | -= — ct}. * dt, 


H(r) = 2n** (re™)' * Ky, (2nre’), 


and therefore 








eee 
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where K,(z) is the Hankel function of imaginary argument.” For k 2 3, its 
expansion in the neighborhood of the origin, apart from a factor independent 
of r and ¢, starts with the term 

° 


(37) =? 
The other terms contain each a positive power of ¢ or a product of a positive 
power of ¢ with log c. Thus, for ¢c — 0, we obtain the term (37) alone, and 
this is actually the Green’s function for the operator Af corresponding to 
v(e) = p'. But the corresponding sum (19) tends to + as ¢ — 0, and so 
does formally the series (10) since its constant term ¢(0) tends to +, if 
¢(p) + p. The reason is that on the torus the operator A”'f does not exist 
for all functions of integrable square, since Af has the proper function f(z) = 1 
corresponding to the characteristic value 0. But the operator A7’f exists for 
all funetions which are orthogonal to this proper function, that is, for all func- 
tions f(x) in whose expansion (3) the constant term vanishes. More generally, 
for this restricted class of functions f(x) the operator g(A)f will exist if ¢(p) is 
(bounded and) completely monotone in 1 S p < «. It will again be repre- 
sentable in the form (11) if in the sum (10) we omit the term corresponding to 
ny = ++: = n, = 0. Suppose in general that ¢(p) is completely monotone 
for x = b, in which case G(x) is defined as the sum 


b vee ~ o(nt + +++ + nj) exp [Qri(miay + ++ + ead). 


2 2 
nyt ttnpeb? 


If a is any fixed number 2 b, we put 


| e da(t) + [ e * dalt) 


gilp) + gal(p), 


¢(p) 


and 


Gy(x) 


Dees > odlat + <-> + af) exp [2ri(mai + +++ + ne re)], 


Gx) = ~h. see p> gi(ni 4 +++ + nj) exp [2ri(miai + +++ + me ax)] 


Sgt* 4 njp<a? 


+ > +++ 2 alnl + --- + af) exp [2ri(miai + +++ + mg ay)]. 


nytorstnpra? 
Obviously all previous considerations remain valid for the pair of functions 
vi(p), G(x). Also go(p) = Ofe"") as p > «, and therefore G2(r) is analytic 
and bounded on the torus. Consequently all previously stated properties 
remain in foree for the functions ¢(p), G(r) themselves. 


®*G. N. Watson, Bessel Functions, pp. 185-189. 
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Part II. The sphere 


We shall now consider the Laplace-Beltrami operator on a k-dimensional 
space of constant positive curvature, and we shall assume that our space is 
given as a Euclidean sphere of radius 1, 


(38) H+ --- +8 = 1. 


It is possible to prove analogues of our previous theorems for the most general 
completely monotone functions ¢g(p) as before. But the formulas are more 
complicated and the argument is rather tedious. However, there are special 
classes of completely monotone functions for which special sets of formulas are 
available, and we shall restrict our attention to one special class of this kind 


6. We shall need a lemma on completely monotone functions. 
If Up) is completely monotone in p = 0, 


(39) He) = i o* dalt), 


and x(p) is the integral, vanishing at the origin, of a completely monotone function 
inp = 0, 


x'(p) = I e dy), 
0 
then the function ¢(p) = W(x(p)) is again completely monotone in p 2 0. 

The lemma can be proved directly from the definition by verifying inductively 
that the n-th derivative of ¢(p) has the sign of (—1)". However this procedure 
is rather cumbersome. A more elaborate but more illuminating proof runs as 
follows. We first conclude from the definition and the integral representation 
that sum, product and limit of completely monotone functions are again com- 
pletely monotone. Therefore, since ¢(p) is a limit of finite sums of the form 

yi e adie P| 1) — alt, )), 


x\e 


it is sufficient to prove that if x(p)t, is replaced by x(p), ¢ is completely 


monotone. Approximating to x’(p) by a finite sum 


> 6" (v(tnai) — y(tn)) 


we may further assume that y’(p) = e °, in which case 


(p) = exp bitline? = ¢ exp | © ‘ 
y(p X] ; = X] rat 


But the latter function is completely monotone, since 


2 pt 2 
exp ¥ al=> = e * dB(u). 
t nao n! e* 0 
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In particular, if » is any constant > 0, we may put 
x(p) = (p+ ¥*)h — >» 
since 
2x'(p) = (p + v’) tig 7% eee at. 
Hence, if ¥(p) is any completely monotone function in p = 0, then the function 
(40) g(r) = ¥((p + v*)' — ») 


is again completely monotone in p 2 0. 
When dealing with the sphere (38) we shall restrict ourselves to the class of 
functions (40), the constant v having the value 


k— 1 
> ° 


- 


y= 


This class is rather narrow; nevertheless it contains the functions (8) for 
0 <¢ <». In fact, the corresponding function ¥(p) is 
[e+ vr) +e - oT" 


and this is identical with 
(41) (p? — ¢) f ee” sinh (* — c)*t dt. 
0 


7. In our present case the Green’s function G(é, x) depends only on the 
geodesic distance between the two points £, x, both on the sphere (38). De- 
noting this distance by r,0 <r S z, we have as an analogue to (10) the formula 


2 


(42) G(r) = .¥ (n + v)go(n(n + 2v))P)’’ (eos r). 


n=0 


The spherical harmonies P,”’ can be defined either by 


(43) > PS? (cos rw" = (1 — 2w cos r + wy” 
V n= 
or by 
(44) a (n + v)P“ (cos rw" = (1 — w*)-(1 — 2weosr + wy". 
n=0 


Now, by (40) and (39), 
g(n(n + 2v)) = Wn) = I e ™ da(t) 


and therefore, by (44), 


Gir) = | (1 —e “) da(t) 


tar 
(45) (1 ne 2 cos r et + e2t)r+1 > 
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Since 
(46) 1 —2cosr-e' +e" = (1 — ec) + Al — cos rye“, 


the function G(r) is monotonely non-decreasing for decreasing values of r. 
About its behavior in the neighborhood of r = 0 we shall deduce the fol- 
lowing theorem. 

THeoremM. The assertions 1, U, Il of §3 are true if in their wording the 
function H(r) is replaced by the function (42) and the function ¢(p) by the func- 
tion (40). 

We first observe that for the present theorem the function (45) is equivalent 
to the function 


tda(t) 


(47) sinialas I errr" 


This follows easily from the fact that, for each « > 0, the differences between 
the functions G(r) and 7'(r) and the corresponding functions 


; , (1 — & *) dat) ‘ *  tda(t) 
G(r) = — -—-- j= ‘ mel 
(r) | (1 — 2eosre 4 e-2tyrtt? T(r) [ (@ 4 p2yett 


are bounded in 0 <r S 2/2, and that 


- G(r) -- Gr) 
1— ne) s lim "sg re ot ), 
We) 3 3 ar = arg 2'+* 
where lim n(e) = 0, since (see (46)) 
e—U 
t 2 ‘ — cosr 

ig a cis ee a 8 

i—0 t t--0 2t r—0 me 
Now, since vy + 1 = 3(k + 1), the function (47) differs only by a numerical con- 
stant from 
(48) | a | W[(ni ove + nz)'] exp [2ri(ni a1 +++ + npay,)|dny-+-dny,, 


where aj + +++ + a = r and y(p) is our present function (39)." But (48) is 
the function (18) with ¥(p') in the place of g(p). Hence the assertions I, I, II 
are true if we replace H(r) by (42) and ¢(p) by ¥(p'). Obviously, for p > «, 
the function V(0') can be replaced by the function (40), and this completes the 
proof of our theorem. 

In a similar fashion we could prove that ¢(A)f transforms analytic functions 
into analytic functions and that the function ¥(p) need not be bounded in the 
whole half-line 0 < p < «. 


® See S. Bochner, Fouriersche Integrale, p. 189, formula (21). 
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8. For g(p) = (p + c)', 0 < ¢ < » we obtain for (45) (see (41)) 
G(r) = | ~ sinh t+ sinh (»* — o)! t- dt 


(cosh t — cos r)’*! 


‘ 


where “=” denotes equality up to a factor independent of r (but dependent 
on k and c). Henee, by partial integration, 


“ue 2__ ,) 
(49) G(r) = cosh (»' — c) tde 
Jo (eosht — cos r)’ 
This formula remains valid for ¢ > »’, in which case 
Gir) = :: cos (c — v®)* t dt 
0 


(cosh t — eos r)”" 


* 


For y = }, that is, k = 2, we hence obtain, writing ¢e — v° = p’, 
G(r) = Qos.) (cos r) + Q-4_,, (Cos r), 
where Q,(cos r) is a spherical harmonic of the second kind.”” For 0 < ¢ < 4, 
the connection between G(r) and the standard function Q,(eos r) is more 
involved. 
For k even and =4 we put in (49), (»* — ec)’ = n + 4, u = sinh 3t, z = sin 4r, 
and obtain 
in * ,(u) du 
G(r) = a ) vga 
Jo (u®+ 2)" 
where 
cosh (n + 3) 


’,(u) = 
cosh 3 


If n is an integer, that is, for 


OS (oa) Ce) ee) 


,(u) has the special form 
a +au +++ +a,u", 
and therefore 


b,, 
G(r) = = + 3 + LJ seed a 


~ ~ oe ~ 


in particular 
1 ‘ . 
G(r) = pk? (ay + ar + or" ae a3r —- eee), 


1K. W. Hobson, The Theory of Spherical and Ellipsoidal Harmonics, 1931, p. 275, 
formula (149). 
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This expansion is remarkable for the absence of terms containing log r which 
one would expect to be present in an expansion of G(r) around the origin. It 
can be shown that for values of c other than (50) the logarithmic terms actually 
do occur, but a further investigation of this question would have but little to 
do with the topic of the present note. 


PRINCETON UNIVERSITY. 




















AN ANALOGUE OF THE VON STAUDT-CLAUSEN THEOREM 
By LEONARD CARLITzZ 


1. Introduction. Let GF(p") denote a fixed Galois field, and x an indeter- 
minate over the field. The function’ 


(1.1) y=) = >, — f'. 
where 
(1.2) [i] = 2” — 2, F; = [afi — 1)" --- ye" aa F, = 1, 


is closely connected with the arithmetic of polynomials in the GF(p"). In this 
paper we study the coefficients in the reciprocal of (1.1), more precisely in t/y. 
In particular we shall be interested in proving an analogue of the von Staudt- 
Clausen theorem for these coefficients. 

In order to define properly the coefficients in the reciprocal it is necessary to 
define a “normalizing” factor (analogous to x! in ordinary arithmetic). This 
is done in the following way. Let 


m= a tap" +--+: +a,p" (0 S a; < p”) 
be the canonical expansion of m to the base p"; then we put 
(1.3) qg(m) = FeeFT' «++ FS, q(0) = 1, 


where F’; has the same significance as in (1.2). Thus for example 
q(p"") = F., g(p™" 7 1) one (Fy om 7". 
We may now define the coefficients of the reciprocal by means of 
t —~ B 
(1.4) = a heel t. 
y > g(m) 


the summation obviously containing only terms in which m is a multiple of 


p” — 1. Clearly By = 1 and B,, is a rational function of x. The analogy 
between B,, and the ordinary Bernoulli numbers is brought out by the relation’ 
l Rs ‘ 
m= =a § (p" — 1|m), 
E™ —_g(m) 


where the summation is over all primary polynomials F, and 


p™*/(p"™—1) 
é = lim -~—— , 
‘° >a. 8-:- 
Received July 21, 1937. 
‘ See this Journal, vol. 1 (1935), pp. 187-168. This paper will be cited as DJ. 
2 DJ, p. 161, Theorem 9.3. 
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In the present paper we discuss some of the arithmetic properties of B,,. Our 
principal result is the analogue of the von Staudt-Clausen theorem for the Ber- 
noulli numbers. We find that 


Aut, +¢ 


] ; , 
deg P=/: P (p" - 2), 
where G,, is some polynomial (whose precise form is not determined), ¢ is an 
integer, not divisible by p, and the summation is over all irreducible polynomials 
P of degree k; finally k is a number depending on m whose existence depends 
on a certain set of conditions (see (7.3) below); if the conditions are not satisfied 
B,, = G,, and is therefore a polynomial. When p" = 2 the result must be 
modified slightly. 

The method of proof depends on certain ideas due to A. Hurwitz.’ While 
the proof is not particularly difficult, there are a number of details that make it 
rather long. In particular it is necessary to prove certain lemmas on g(m) 
which are of some interest in themselves. 


2. Lemmas on g(m). 
THEOREM 1. For m,, m. 2 O, the quotient 


g(m + me) 


(2.1) g(my)g(me) 
is integral (that is, a polynomial) .* 
Let 
(2.2) my = Bo + Bip" + --> + Bep™ (0 SB: <p’), 
Mm = yot yp +--+ +y.p” (0 < vi < p’); 
then 


(2.3) my + me = (Bo + yo) + +++ + (Bs + ¥.)p™ (0 < Bi + yi < 2p"). 
If now we put 
Bo + Yo = a@ + bop’, 


where dy = 0 or 1, and 0 S a < p”, we may define 6; , +++ , 6, recursively by 
means of 


II 


60 + By + Yi 
(2.4) 51 + Bo + yo = a2 + dep", 


a, + dip’, 


3 Mathematische Annalen, vol. 51 (1899), pp. 196-226 (= Mathematische Werke II, 
Basel, 1933, pp. 342-373). 

‘ Throughout this paper the word ‘“‘integral’’ will be used to denote a polynomial in x 
with coefficients in GF(p*). 
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where cach 6; = 0 or 1, and 0 S a; < p”. Thus (2.3) becomes 
my + me. = a + amp" +++: tap” + bp”. 
Hence by the definition of g(m) in (1.3), 
g(m, + m) = FY' --- FoF *s,. 
Comparison with (2.2) leads at once to 


gQm, + me) e. YE pei-0i-1 
- = I at I ’ ’ i 
g(my)g(me) vii II . 


Sinee by (1.2) Fu. = [s + 1]F?", the right member 


s—1 
[s + i}: peat tee Ba—Ye II Far Yj 
j=! 


s—l 
= [s + ee 1 II pei Ye 
j=l 


by the last of (2.4). Again 


phon prg tint 2 paper rte, 
Proceeding in this way we have finally 
(2.5) g(m, + me) = [s + 1} *fs}m ~~ [ape 
~~ q(m)g(me) 


Thus we have not only proved that (2.1) is integral but have derived the explicit 
formula (2.5). 
As an immediate corollary it is evident that for m, +--+ ,m, = 0, the quotient 


(26) qim + +++ + m,) 
g(m,) «++ g(my) 
ts integral. 

If in (2.6) we take m, = --- = m, = m, we see that g(km)/g*(m) is integral. 
For later purposes it will be necessary to know that this quotient is divisible 
by g(k). But we may prove without difficulty the following slightly stronger 
result. 

TuroremM 2. Form = 


a 
— 


, define » = ulm) by means of 
m= a + ap" +--+ + ap” (0S a <p’), 
B=a@etat::: ta. 

Thus form 21,4 21. Then the quotient 


g(km) 
q*(m)g*(k) 


(2.7) 


is integral.” 


> Compare Bachmann, Niedere Zahlentheorie. 
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It will be convenient to take first the special case m = p"', k = p"’, so that 
uw = 1. Weshall require the 

LemMa. For alli, 7 = 0, the quotient 
Fi; — fi+ii+7—-1) 


spri 
re" PF, F; 


d 


yn s n(j-—1) 
- -++ [i+ 1)” 
(2.8) (i, j) = (+P 
is integral. 
This is easily proved by induction. From (2.8) follows 


(+P _ (A+ FE 


(1,7) = = = = 
J F? 7 F; F? F; 
ee Secae xs ‘ 
= (i, j sae 1)’ + ( +t?) (7 gor l,j). 
Fi-1 
Since (¢, 0) = 1 = (y, O) it is evident from this recursion formula that (7, 7) 


is integral for arbitrary non-negative 7, 7. 
Returning to the general theorem, we now suppose m arbitrary but to begin 
with again take k = p"’. From (2.7) and the definition of g(m) it follows 


easily that 


g(p"m) 


ra ee 


Replace m by p"m and this becomes 


eC 
q’ (p"m) 


combining the last two equations we have 


g(p "m) 


mow = Us + Mls + UP) - (QI). 
gq’ (p"m) 


Continuing in this way we see that 


7) ta er, 


(2.9) 4 - 
gq?’ (m) i=0 
If we compare this with (2.8), it is clear that the right member is a multiple of 


F;° Pot oe = Fk y 


so that the theorem is proved for the special case k = p”’ 
In the next place, for 8 2 1, it follows that 


g?(p™ m) 
om) * 


(2.10) i? 
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But by Theorem 1, g°(p"’m) | g(8p"’m), so that (2.10) implies 
g(Bp™ m) 
gr?" (m) ’ 
thus the theorem is proved for k = Bp"’, where 0 < B < p”. 
Now take 
(2.12) k = ap" + Bp” (0 < a, 8B < p";i $j). 
Then by (2.11) we have 


(2.11) Fe 


pee poe g(ap™ m) g(Bp™ m) - 
i ] gn" ‘ (m )r”’ (m) , 
but by (2.12) and (1.3) 


g(ap"’ + Bp") = FIP}, 


and by Theorem 1, 


g(ap"'m)g(Bp"’m) | gi(ap"' + Bp”’)m}, 
so that 
g(km) 
q’(k) e 
g*(m) 
for k as in (2.12). Proceeding in this way we see that Theorem 2 is true 
generally. 


3. H-series.” If in the series 


249" 
(3.1) icc. 

m=o g(m) 
the coefficients A,, are integral, we shall call (3.1) an H-series. It follows at 
once from the definition that if S and S’ are H-series, then AS + A’S’ is also 
an H-series, where A and A’ are any two polynomials in x alone. As for the 
product of two H-series, if A, , A}, are the coefficients of S and S’ respectively, 
and C,, denotes the general coefficient in SS’, we evidently have 


g(m) 14! 


C,, = : ~_ A;A;. 
itjem g(t)g(j) : 


By Theorem 1, the g-quotients on the right are integral so that C,, is integral. 
Therefore the product of two H-series is itself an H-series, and generally for 
the product of any number of series. 

Consider next the reciprocal of S. In general this is not an H-series. If 
however Ag = 1 or any non-zero element of GF(p")— then the reciprocal is 
tlso an H-series. Thus for Ag = 1, put S = 1 + S,; then 


l | 


= =1-S:+Si--:::, 
S 1+58, it 1 


‘Compare Hurwitz, loc. cit. 
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and it is clear from the above that the right member is an H-series. Finally it 
follows that in this case S’/S is also an H-series. 
Of special interest is the series 


x 


Ye 
3.2 i = =) 
(3.2) : au g(m) ’ 


which has no constant term. We have seen above that for the k-th power, 
x y m 
k ( mt 
S = ; 
‘ m=k g(m) . 
C,, is integral. We shall now show that C,, is a multiple of g(k), or what 
amounts to the same thing, we prove 
TuroreM 3. If S, is an H-serics without constant term, then Si/g(k) is also 
an H-series. 
Assume first k = p"’. Then by (3.2), 
~*~ ’ p"* gmp" ni 
Ca > At . gp" m) 
r g(p"m) g” (m) 


But by Theorem 2, the second fraction on the right is divisible by g(p"') = F,. 
If then we put 


od a 
(3.3) =? eS 


g(m) 
we have F; | C,,. Squaring both sides of (3.3), we write 


y! am 
2pni a 2 q(m 
Sy = - where C,, = gm) 


q(m)’ etyam gle)g(f) 

Thus it is clear that F? | C.. , in other words see" /F? is an H-series. Similarly 

for S””’/F¢ . In other words, S}/g(k) is an H-series for k = ap"',0 <a <p". 
Suppose next that 


y 
e 


(3.4) k = ap" + Bp”’, 0<a,B < p’,i ¥j. 
Put 
a > A, ; a > A,,t . 
g(m) © g(m) 
so that 
(3.5) F7|A., F? | AS. 
Then for 
si - ts C,,t a g(m) F ., -f 
g(m) ctf=m gleg(f) 


so that by (3.5) 


sa pw ’ 
Fi F;\C.. 
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But by (3.4) 


FFF; = g(ap"' + Bp”) = g(k), 


so that the theorem is proved in this case also. It is now clear how the theorem 
may be proved for general k. 

As a corollary of some interest, we state: if an H-series without constant term 
be substituted for t in the H-series (3.1) and the result written as a series in t, then 
this series is also an H-series. 


4. Some theorems on B,,. If we define AS” by means of 


x (k) 
yr l _ > A m r 
_— ’ 


m= prk—] g(m) 


then we have the formula’ 


An’ 

4.1) B,, = OM ” 4 

where 

(4.2) L. = (k){k — 1) --- (1), Ly = 1. 


Now A.’ is integral and by Theorem 3 is a multiple of g(p™“ — 1). But by 
(1.3) and (4.2), 
(4.3) g(p" —1) _ (Feo Fh 1 
ly Ly-1 [k] 

If we recall that [k] is the product of the irreducible polynomials whose degree 
divides k, it is clear that if the left member of (4.3) be reduced to its lowest 
terms, then the factors of the denominator are simple. Further, except for the 
case p" = 2 = k, the irreducible factors are all of degree k. This proves the 
following 

Tueorem 4. Jf B,, = N,,/D,, , where N,, and D,, are relatively prime, then 
D,, has only simple factors. 

In the next place from the identity” 


(4.4) ry(t) — (at) = y’"(b) 


follows 


as 7% ~ ¢ _,woO-va _woO_ wo _ yon 

~~ Prt) vit)  vViQatvit) Vat) «r—y"(t) “g(m) 2’ 
where, by the discussion of §3, the coefficients C,, are clearly integral. Hence 
by (1.4) and the last theorem, it follows that the product x(x" — 1)B,, is integral. 


* DJ, p. 158, formula (8.11). 
* DJ, p. 150, formula (5.09). 
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But this result may be extended considerably. If G is an arbitrary poly- 
nomial of degree s, say, then in place of (4.4) we have the more general formula’ 


(4.6) Gui) — (Gt) = Aw” (0), 
j=l 
where the A; are integral; indeed 
oe j es ee 
A; = Pr, (GQ), yi(u) = > (—1) FL Pe . . 


Thus (4.5) becomes 


Gt t _ ,Gvio- VG) _ tyawr” 


ua) vO) VG@W0® G-YaAw"™ 


Now both numerator and denominator on the right are H-series, and it is easily 
seen that if we replace t by Gt the quotient becomes an H-series. Thus applying 
Theorem 4, we have 

TuHeorEM 5. For G an arbitrary polynomial, the product G(G" — 1)B,, is 
integral. 

Assume now in the notation of Theorem 4 that P| D,,, where P is an irre- 
ducible polynomial of degree k. Since in the theorem just proved, @ is quite 
arbitrary we may take it equal to a primitive root (mod P). Now by the 
theorem, 

G" = 1 (mod P), 


and therefore because of the nature of G, m must be a multiple of p™ — 1. 
This proves 

TueoreM 6. If P is an irreducible divisor of the denominator of B,, , then 
p™ — 1 divides m. 

If we return to the definition of AS and make use of (4.1) and (4.3), it is 
now evident that for p™ — 1 not a divisor of m, A“ isa multiple of P. There- 
fore in determining the fractional part of B,, it is necessary to retain in the right 
member of (4.1) only those terms for which p™ 
the following 

THeoreM 7. For P irreducible of degree k, we have the congruence” 

rad 
prk—ajm g(m) 


~ 1 only. The formula (4.1) 


— 1|{m. We may now state 


(4.7) yr" ‘= (mod P) 


P ° . nk 
the summation extending over multiples of p 


reduces to 


(4.8) B,, = G,. + 


(k) 
An 


’ 
prelim Li 


* DJ, p. 151, formula (5.11). 


Awi™ A,,t” ; ee , 

1° The statement > : (m) = yi (mod P) is short for the infinite system of congru- 
g(m 

ences A,, = A, (mod P). 


g(m) 
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Pa ae . : 1k 
where’ G,, is integral, and the summation extends over such k for which p™ — 1 
divides m. 
~ yk —4 y ° ° ° 
5. A lemma ony” —. We now prove the following theorem, which is the 
most important point in the proof of the main theorem concerning B,, . 

TueoreM 8. For P irreducible of degree k, 


prkiyp (— 1)" pnki ait 
(5.1) y = 7 Fr t (mod P). 
ki 


In other words, in forming the (p"“ — 1)-th power of y(t) (mod P) we may 
ignore those terms in ¢””” for which 7 is not a multiple of k. To prove (5.1), 
we remark first that by Theorem 3, 

y”" =0 (mod P). 
Combining this with (4.7), we have the congruence 


ARO" se (=D)! ni 
” #0 (mod P). 
pk—tjim g(m) F; mod F) 


Picking out the coefficient of t” on the left, we get 


g(m) = 
at (—1) : A ni 
- ” 2, Fg(m om p™) ! 


~ 
— 


k 


We suppose hereafter that m — 1 is a multiple of p"’ — 1. But by Theorem 7, 


A ni = 0 for p“ —14m— p", 


that is, for 7 not a multiple of k. Therefore (5.2) becomes 


(5.3) z. (=5)".. g(m) A” acc = 0. 

prti<m Fy¢g(m — p™) 
Now if p™ / m it is easily verified that the quotient of g(m) by g(m — 1) 4 0 
(mod P). Indeed if p"™ | m, p"“*" 4m, the quotient = L,, as defined in (4.2). 
Hence for p™ 4m, (5.3) becomes 


) l i g(m) . 
- Awa =] —1)* 5 5 AL ns 
” L, or ( Fi3q(m _ p™) se, 


the summation extending over 7 > 0 only. 
, 1 
We next recall a result proved elsewhere: 


2 


(5.5) Aen = 0 for m > 1, 


which we shall apply to (5.3) in order to show that AS’ = 0 (mod P) for a 


Il 


1 @,, will generally denote a polynomial depending on the index m and not necessarily 
the same in all formulas in which it occurs. 
"DJ, p. 158, Theorem 8.3 
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certain set of values of m. First we make a slight change in notation. In 
(5.4) replace m by m + 1, so that 

(5.6) p —1|m, p Xm +1. 

Assume now that (5.6) holds but in addition p™ 
shows that Af” is a sum of terms of the type 


(m+ 2. Then clearly (5.4) 


(k) fe 
A m+1— paki (2 Ps 0); 


nki 


but according to (5.5), this vanishes unless, for some i, m + 1 — p“' = p™ — 1, 


that is, unless 
(5.7) m= p™' + (p™ — 2)-1. 
In the same way if 
pp" 4m +1, pm + 2, p|m+3 
(so that p"™ > 2), then two applications of (5.4) lead to a linear homogeneous 
expression for .1,,’ in terms of the type 


(k) 


A n-2—prki_pnki (i,j > 0), 
which vanishes unless m + 2 — p™' — p“’ = p™ — 1, that is, unless 
(5.8) m= p™' + p™’ + (p™ — 3)-1 (p™ > 2). 
Now in both (5.7) and (5.8) m is expressed as a sum of p™ — 1 terms p™". 
We shall now show generally that unless 
(5.9 = = gts + rer + gr * a »~ eS 1, 
) / i i 
AS = 0 (mod P). For suppose 
pim+i (@=1,---,@, 
(5.10) . ‘ 
p’*m+t+1 (t < p™). 


Apply (5.3) ¢ times and A? is exhibited as a sum of terms 
= w=m+t— pM —... — pp, 
Since by the second of (5.10) p™ | w + 1, follows from (5.5) that A“? = 0 
unless w = p™ — 1, that is, unless 
m= pa +---+p 


which is precisely the condition (5.9). 
. . om . pnk 4 ° 
It is now easy to establish the congruence (5.1). If we expand y’ directly, 
it is clear that a term involving t” occurs when 


(5.11) n= " 4 ‘ee 4 ol r= p™ cs l. 


nkis 


+ (p“ —t—1)-1, 


But by the result just proved the sum of such terms for fixed m will be = 0 
(mod P) unless m is of the form (5.9). But from this it follows almost imme- 
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diately that each j in (5.11) is a multiple of k. Hence in forming vy it is 
necessary to use only the terms in t” seat but as remarked at the beginning of this 
section, this is equivalent to (5.1). We have therefore established Theorem 8. 
As we shall see in the next section it is now possible to evaluate A®., In 
proving the von Staudt-Clausen theorem for the ordinary Bernoulli numbers, 
it is possible to make use of certain explicit formulas. Their analogues are not 
available, and therefore some such method as used here seems necessary. 


yprk_y y . 
: We now consider 


(6.1) (= wT ‘a 7 


Rewrite (5.7) in the form 


6. Further lemmas on y 


m= a + ap” +--+ +ap™" (a; = 0), 
(6.2) a 
p ~-leatat*:*' +a. 
Then it is clear that (6.1) becomes 
(p™ ens 1)! (ee a 


rs a) Pree +a ’ 
m ao!ay! ++ a! F; _— Fee 


the summation extending over all m for which (6.2) is solvable. Making use 
of (5.1) we see that 

, nk “= ! 
(6.3) a = (—1)Marte tan (P 1)! g(m) 


D al ae 
ao! :++ a! Fy’ +++ Fey 


where again m is of the form (6.2); for other m, AS’ = 0. All the congruences 
are (mod P), where as above P is irreducible of degree k. To determine when 
the multinomial coefficient in the right member of (6.3) is different from zero 
(that is, not a multiple of p) we use a theorem of Dickson's: If the coefficients 
a; in (6.2) are of the form 


(6.4) a= Ly aij p (0S aij; S p—1), 
then 

(ao, *** , a) -_ 
is prime to p if and only if 


(6.5) 2 au = p— i (j = 0,---,nk— 1). 


i=0 


% Annals of Mathematics, (1), vol. 11 (1896-97), pp. 75-76; Quarterly Journal of Mathe- 
maties, vol. 33 (1902), pp. 378-384, 
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If (6.5) is satisfied, then 


(—1)™ 
(6.6) (av, +**,a@) = TT (mod p). 
I] a:;! P 


We may therefore assume that m satisfies both (6.2) and (6.5). 
It is now easy to evaluate the g-quotient in (6.3). Put 


(6.7) m = Bo + Bip" + hp” +--- (0 < & <p’). 
Comparison with (6.2) gives 

k—1 
(6.8) a: = 2, BeisiB™ (i = 0, «++, 8). 


Comparison with (6.4) gives 
n—l 

(6.9 er? See 
c= 


Now by (6.7), gim) = FY' FS +++ , from which follows by use of (6.8), 


qim) 


Fi! - ++ Fe, 


Il 


(Ft +++ Fa')(FY* +++ Fer!) ++ 


a pra: +3 i+ “past Beret 


But by (6.9) and (6.4) 


so that the g-quotient 
(6.10) (F,--- Fa)" = (—1) t 


by Wilson’s Theorem and the fact’* that /; is the product of the primary poly- 
nomials of degree k. Therefore by (6.3) and (6.6) we conclude that for m 
satisfying both (6.2) and (6.5), we have 


> A k la klay om? ( 1)“ : 
(6.1) he (—]) ; (mod P); 
I] a«i;! 


in all other cases, A,, (0). 
Lat m be fixed; we shall now show that at most one value of k can be found 


for which (6.2) and (6.5) are simultancously satisfied. Yor assume the relations 
nl tal 
m Yor MUP Tr*°* +p 
p P= +H + °** HM 


gulletin of the American Mathematical Society, vol. 3% (1982), pp. 756-714; also DJ, 
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(6.12) — 
Yi 


p-l= Do vi. 


Then from (6.2) and (6.5) follows 


s nk—-1 
z: z. ai; = (p — 1)nk; 


i=? j=0 


(0 = Viv 4 aed 1), 


II 
~ 
<: 
— 

~ 


on the other hand from (6.12) follows 


ni—l 


> » is Bi; = (p — I)al. 
i=0 j=0 
But clearly the a;; and 8;; coincide (except for numbering), and therefore 
k = l, as asserted above. 
We remark that there may be no value of & for which (6.2) and (6.5) hold. 
In this event B,, has no fractional part. 


7. The main theorem. We return to (4.1). In view of the last result in §6, 
(4.1) becomes 


(7.1) B,, = Gn + — Ax’, (p” = 2) 


provided k exists satisfying both (6.2) and (6.5); otherwise (7.1) is simply 
B,, = G,, , so that B,, is integral and nothing further need be said. Assuming 
then that a & exists, we make use of (4.3). If we exclude for the moment the 
case p" = 2, it follows that the irreducible divisors of the denominator of B, 
are all of degree k. Now it is easy to show (see the remark immediately following 
(4.3)) that 

l ad 

(k] wee rye P’ 


where the summation extends over all irreducible P of degree a divisor of &, 
and 2?’ denotes the derivative of P. By (4.3) we have 


ky ub) , 
(7.2) An Gi, A. . 
Ly Li, 1 deg Pek P 
the summation now extending over irreducibles of degree k only. But’ 
(—1)"" Law ar (mod ?), 
so that (7.2) becomes 
AS Ge ( 1) 1 qc Ss l 


D0, po 166, formula (iE 10) 
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Finally, making use of (6.11) and substituting in (7.1), we have 
THEOREM 9 (p" # 2). For given m, the system 


nki nk--1 
n= a;jp , p Qi, 
i=0 


ll 


i=0 
(7.3) nk=1 s 
ag = : aijp’, eins l= >> Qij, Qijez 0, 
j=0 i=0 


is either (i) inconsistent, or (ii) consistent for a single value of k, in which case the 
a; , a;; are uniquely determined. In case (i) B,, is integral; in case (ii) we have 
(ag Oren ---t00 y 1 

[J a)! deg P=k P™ 


7 ° 


(7.4) B, = Gn - 


Here G,, is integral and the summation is over all irreducible P of degree k. 

It remains to consider the excluded case p" = 2. We may no longer conclude 
from (4.3) that the denominator of B,, contains only irreducibles of degree k; 
polynomials of the first degree may also occur. To decide when this happens 


we examine 
,3 t t* 2 t' r 
y (+5 +h + N( + et mt ), 


Clearly A,”’ is different from zero only when m is of the form 2% + 2° > 2. 
Since in this case 
F.F3 for a # 8, 
g(m) = 
Past for a = B, 
it is evident that for a # 8, 


s 


l if 
\FaFs 1 FsFe 1 
= [6] + le] = 2” + 2°, 


A® = g(m) ¢ 


while for a = 8, 


Aj’ = g(m) = [a + Ifa). 


J 
2 
FF 


(2 


Thus for a = 8, it is clear that A,,’ is a multiple of Le. 

For a # 8, there are several possibilities. Ifa > B > 0, then A®? is divisible 
by (z* + z)*; but if a > B = 0, then A’ is divisible by 2° + x only and the 
quotient A,’ /(z* + 2) is congruent to 1 (mod z* + x). Again for a = B (mod 2), 


Ax is divisible by 2° + 2 + 1, while for a A B (mod 2), AL? 1 (mod 
x +2+1). We now note that if the system (7.3) is satisfied for p" = 2 = k, 
then it follows at once that m = 4° 4+ 2-4’; in other words, this is the case 


a # B (mod 2). Also it is easily seen that any other value of k is inconsistent 
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with m = 2° + 2° (a > 8). Hence we have the following supplement to 
Theorem 9: 
THEOREM 10 (p" = 2). If the system (7.3) is consistent for k # 2, then 


7 E ’ 1 
(7.5) B, = G, + eo Pi 
if k = 2, then for m even, 
(7.6) B,, = G,, + ’ 
w+rt+il 
while for m odd, 
l l 1 l 


7.7) B,, = G,, ~ : 
(7.7) + x chi Peael 


If (7.3) ts consistent for no value of i, then for m even, B,, = G,, , while for m odd, 


l 
r+1° 

The following remark may be useful in testing (7.3). We assume m fixed: 
pick a k such that p"™ — 1|m. Then (because of the condition 0 < a; < p™) 
the @; are uniquely determined. If their sum is not p™ — 1, we go no further. 
If however the sum is equal to p™ — 1, we use the equation a; = 2, a p’ to 
determine the a;; (because of the condition 0 < a;; < p, the determination is 
unique). It is then only necessary to check the system of equations 


p-—l= Qj. 
F 


A partial cheek on Theorems 9 and 10 is furnished by the case m = p 
. . ee 16 - ° . 
for here a simple explicit formula” is available for B,, : 


ate 


a : + 


, l 
’ 


q(m) (Fy s+ Fy)? 


7.8) B,, = = 
« Le Ly 
Kor p’ = 2 = k, this reduces to 
l 1 
Catt 


> — , 
(2) xr wr+1 r+r+1 
which agrees with (7.7). In all other cases the irreducible divisors of the 
denominator of B,, are of degree k, and it is evident that (7.8) is in agreement 
with (7.5). For this value of m, it is clear thats = 0,a) = p™ — lay; = p 5 
AS = g(m). 


Dune UNIVERSITY 


"DJ, p. 159, formula (9.02) 








SUMS OF VALUES OF A POLYNOMIAL MULTIPLIED BY CONSTANTS 


‘ 


By KENNETH S. GHENT 


. 
1. Introduction. We seck conditions on the integer s, on the sets of positive 
integers (a; ,--: ,a,) and on the coefficients of a polynomial P(x) for which 


the Diophantine equation 


(1) n= > a,P(h,) 


y=] 


is solvable in integers h, 2 0 for every integer n sufficiently large. By n suffi- 
ciently large we mean that n is greater than an existing constant b; which 
depends only on s, a; , ++: ,a, and on the degree k and the coefficients of the 
polynomial P(r). We consider two cases of the polynomial P(x): 


(2) P(z) = a(x’ — r)/6 + b(a” — x)/2 + er +d, 
where a > 0 and a, b, ¢ and d are integers;' 


P(x) = ax(x + 1)(2 + 2)(2 + 3)/24 + br(a + 1)(2 + 2)/6 


(3) 
+ cr(x + 1)/2 + dr +e, 
where a > 0 and a, b. ¢, d and ¢ are integers.” 
For a, = --: = a, = 1, the problem is the classical Waring problem for 


third and fourth degree polynomials. If P(x) in (2) is such that a # 4c (mod 8) 
James has shown that every sufficiently large integer n is a sum of nine values 
of P(x) We show that for s = 9 and for integral constants (a), «~~ , a.) 
satisfying certain congruential conditions given later, every sufficiently large 
integer can be expressed in the form (1). For P(x) as in (3), Miss Humphreys 
has given conditions which are sufficient to prove that every sufficiently large 
integer n is expressible as the sum of 21 values of P(x). Under certain further 
assumptions on P(r) and on the sets of positive integers (a, ,--- , a.) we shalj 


teceived June 26, 1936; in revised form, June 28, 1937. 

/ A cubic polynomial in z is an integer for all integers z 2 0 if and only if it is of the 
form (2) R.D. James, The representation of integers as sums of values of cubic polynomials, 
American Journal of Mathematics, vol. 56 (1934), pp. 303-315. See also D. H. Hilbert, 
Uber die Theorie der algebraischen Formen, Mathematische Annalen, vol. 36 (1890), pp 
511-512. 

* A fourth degree polynomial in x is an integer for all integers 2 2 0 if and only if it is 
of the form (3). See M.G. Humphreys, On the Waring problem with polynomial summands, 
this Journal, vol. 1 (1935), pp. 361-375. The proof is accomplished by a slight modifica 
tion of that of Hilbert in the paper previously cited 


* James, loc. cit 
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show that every sufficiently large integer n can be expressed in the form (1) 
for s = 21. 
The proof depends on certain analytic theorems and on certain congruential 
r ° 4 5 
theorems. The analytic theorems follow closely those of James’ and Landau. 
» ° ° 6 . ° 
We quote the analytic theorems without proof; the congruential theory is 
given in detail. 


2. Notations. The following notations are used throughout this paper. 
Further notations will be introduced when required. 


K=2"; @21-1/K; fiz) = D2; 


h=0 
r(n) = r(n, k, s, a3, +++ , as 3 a, b, c, d, e) = the number of solutions of (1); 
p denotes a primitive q-th root of unity; 
rerm 
Y a,P (hr) 
S(a,, p) = - p ; 
h=r+l 
’ a, P(A) 
S», = . pe ’ 
h 
where A ranges over a complete set of residues mod q; 
l To 
Add =— UITS,,0": 
7° plq) 1 
x 
Go = Sa = oe Ao(q) will be referred to as the Singular Series; 


« 
v 


(a, b) = g.c.d. of a and b. 


3. Principal analytic theorem. Consider the following polynomial 
(4) (x) = aor + ar’ +--+ + ae, 


where ay > 0 and a, a, *** , ay are integers. The value of s considered will 

be that of the first Hardy-Littlewood theory. The notations of the previous 

section are used with P(x) replaced by @(7). Consider also equation (1) with 

P(x) replaced by (7). The theorem which follows can then be proved for the 

polynomial &(x) of degree k by a generalization of the proof given by Landau.’ 
Tueorem 1. Fors = (k — 2)K + 5 


ag" T'"(1 + 1/k) 
(a, «++ a,)*/*T(s/k) ~° 


> _ s/e-1— By 
r(n) — n < Con” : 


‘ James, op. cit. 

*E. Landau, Uber die neue Winogradoffsche Behandlung des Waringschen Problems, 
Mathematische Zeitschrift, vol. 31 (1929), pp. 319-338, 

© Proofs of the analytic theorems are given in the writer's doctoral dissertation at the 
University of Chicago, August, 1985 

’ Landau, op. cit 
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where Co is a constant depending only on ®, s and max a, and By is a constant 
depending only on k. 


4. Primitive solutions. Consider (x) as in (4). Let @ be the highest power 
of the prime p which divides every coefficient of &’(x). Then 
(x) = p &'(x) . 
has at least one coefficient prime to p. We define y as follows: 
6+ 1 for p > 2, 
6+ 2 for p = 2. 


II 


7 


Let Mo(m) = My(m, n) denote the total number of solutions of 
(5) ts a,?(z,) = n (mod m), 0s x, <7. 
v=] 


For m = p’, let No(p', n) denote the number of solutions of (5) for which 
p/* g.c.d. of %(2,). Let N.,(p', n) denote the number of solutions with p/ g.c.d. 
a,%(z,). The last solutions defined will be referred to as primitive solutions. 
By generalizations of the work of Landau and James previously referred to, 
the following three lemmas can be proved. 

Lemma l. Ifl = y then 


N.,(p') = p ”*-’N.,(p"). 
LEMMA 2. 


M.(m) = m"' > A,(q). 


Lemma 3. If p, denotes the h-th prime, then 


2X Aoly) = IT 2 Aolg). 


<p ap! 
aipt--+p! PSpt ait 
1 t 
The following lemma is also required. 
LemMaA 4. Fore 20 
ld4t+e 


Ao(q) > —kaq” ’, 
where Ez, is a constant depending at most on k, 8, ag, +** Qe, A, °°" 5 a. 
5. Application to the cubic and to the quartic. The analytic theorem and 


the lemmas that we have stated apply to a polynomial b(x) of degree k with 
integral coefficients and leading coefficient positive. The polynomials P(x) of 


(2) and (3) have leading coefficient positive but the coefficients are not integers. 
Hence, for these cases, let Q(z) Q(x; v, t) P(2v + t), where t = O and 
v > O are definite integers depending on the coefficients of P(x) in the respective 
cases. We choose » so that it contains the least number of factors required to 
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make Q(x) a polynomial with integral coefficients. The choice of t depends 
more specially on the case under consideration. For Q(x) we define 6 as before. 
For both the cubic and the quartic, p > 3 implies @ = 0 since p does not divide 
all coefficients of P(2). 


6. Introduction to the congruential theory. The main portion of this paper 
is devoted to the development of certain congruential theorems® for the poly- 
nomials (2) and (3). We are then able, with the aid of the analytic theory 
introduced, to prove the principal results given in Theorems 3 and 5. We 
assume that the cubic is not of the type excepted by James’ and that the quartic 
is not of the type excepted by Miss Humphreys.” 

The following definition will be used: For a prime p and a positive integer l, 


a set of positive integers (a, ,--- ,a,) has the property S(p, p’) if one of the 
set (a, ,--- ,a,) which we may designate by a; is prime to p and if the 
a; (¢ = 1,--- , 8; 7 # 9) are such that for every integer n there exists at < s 
for which 

t 

Da; =n (mod p) 

i=] 


is solvable. 


7. Results for cubic polynomials. We seek conditions on the coefficients of 
P(x) in (2) and on the constants (a; , --- , ag) under which the equation (1) is 
solvable in integers h, = O for all sufficiently large integers n. We may evi- 
dently assume d = 0 in (2). We may also assume that a, b, c have no common 
factor other than unity; for if p | a, p|b and p|c, then p| P(x) and ¥ a,P(z,) 

vel 
would represent only multiples of p. 

We first prove Theorem 2, a congruential theorem for the cubic; and then by 
Lemma 5 we obtain Theorem 3, the final theorem for the cubic. We state these 
theorems as follows for P(x) as in (2). 


THeoreM 2. If the polynomial P(x) and the set of positive integers (a, . ++ > , a.) 
salisfy the following three conditions: 
(a) for every prime p > 3, at least six of the positive integers (a, ,++* , a.) are 


prime to p; 
(b) the set (a; , +++ , a.) has the properties S(2, 2°) and S(3, 3°); 
(c) a # 4e (mod 8), 7.c., the polynomial is not of the type excepted by James: 
then for every integer n and for Q(x) = Pur 4+ b) there exist primitive solutions of 
the congruence 


s>o 


(6) ie a; Q(x) = n (mod p”)., 


i=l 


*'The methods of proof are similar to those of L. Eb. Diekson, Cyclotomy, higher congru- 
ences and the Waring's problem, American Jour. of Math., vol. 57 (1985), pp. 891 424 
* Loc, cit. 


1° Loe. cit 
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THEOREM 3. Lets 2 9 be an integer; and let P(x) and the set of positive integers 
(a; , +--+ , a.) be such that the conditions (a), (b), and (c) of Theorem 2 are satisfied. 
Then there exists a number Cy depending only on s, a, b, c and a,,--- , a, such 
that every integer n > Cw is expressible in the form 


s>9 


n= > a,P(z,), r, = 0. 


v=1 


8. Congruential theory for cubic polynomials. We first consider (6) for 
primes p > 3. In §5 we saw that 6 = 0 for p > 3, hence y = 1. Consider 
the polynomial Q(x) in the form 


(7) Q(z) = Aix’ + Asx” + As2, 


where A,;, Ae, As are integers whose g.c.d. is unity. Suppose, first, that 
A, =0(mod p). Then the congruence is a second degree congruence which can 
be treated directly. Hence suppose A; # 0 (mod p). Then we may evidently 
take A; = 1 (mod p) since, if we determine d by dA; = 1 (mod p), dn ranges 
with n in (6) over all residues (mod p). Let a, +--+ ,a@¢ be six members of 


the set (a;,---,a,) which are prime to the given prime p. Then each 
a; (( = 1, +++ , 6) satisfies one of the congruences in 
(8) a; = 2’, a; = gz’, a,;= gr (mod p), 


where g is a fixed primitive root of p. Hence without loss of generality assume 
(9) a, = ath G3 = ts (mod p). 
Case 1. AvA;(a, + d2)(a; + as) A O (mod p). Let rz = —mary. Then 


aQ(z;) + alQ(re) = ay(ujz; + 12) + As(aguizi + dere) + As(aguia, + are) 


3 3 3.3 3.2 2 3 
Oy( usa} — 4:7; ) 4 Aotte( Uy r} + u12)) + Agas(uyry ~ My Ty) 
Aotoluy + uy)ay + Agael(uy — m)a) (mod p). 
Similarly, let cz; = —wu3r3. Then 


a,Q)(a3) 4+ agQ(a,) Aoa,(u;, 4 M3)a3 + Ayay(uy Us)r, (mod p). 
Next” let a; X, + 2, where z is such that 
(2QAca(u; 4 M3) )z + Ayao(ui ;) 0 (mod p). 


This has one solution, since, from hypothesis, 2Agae(ui 4+ uj) is not divisible 
by p. Similarly, set rz; = X, + v, where v ts such that 


2Ava;s(us + uz)v + Ayala Us) 0 (mod p). 


Vf Ayay(u; uy) 0 (mod p), this transformation is unnecessary 
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We have then essentially 
4 


(10) > a; Q(x;) 


rr + sy =n (mod p), 


where neither r nor s is divisible by p. This congruence is solvable for every 
integer n. 

Case 2." AsA; # 0, a, + a2 = 0 (mod p). Let x = re + Ll, where lis so 
chosen that 31 + A. # 0 (mod p). Then 


a,Q(21) + aeQ(x2) = aQ(rte + 1) + aaQ(2x2) 
(a, + d2)Q (x2) + a;(2x3(3l + As) + 2.(30 + 2lA2 + As) 
+ (' + PA, + 1A3)); 


a,Q(21) + aeQ(r2) = a,(3l + Ao)az + a,(30 + 21A, + As)r2 + const (mod p). 
The remainder of the proof is as in Case 1. 

Case 3. AsA; = 0 (mod p). We show that in this case we can make a 
transformation, x = NX + 1, which carries Q(x) to 
Q(z) = X* + AsX? + AjX + const 

= X* + NX°(3l + As) + X(3P + 2Ael + As) + const 

in which AA; 4 0 (mod p). Suppose first that A, and A; are both divisible 
by p. Choose l prime to p and we have A3A; not divisible by p as desired. 
Suppose next that A¢ is divisible by p but that A; is not divisible by p. Now 


(11) (3F + As) = 0 (mod p) 


has at most two incongruent solutions for 1. Hence there are at least (p — 2) 
incongruent values of l for which (11) is not satisfied. Any one of these (p — 2) 
values of I makes A343 4 0 (mod p). Finally, suppose that Ae is not divisible 
by p but that A; is divisible by p. There is a unique value of / (mod p) for 
which 31 + Ae is divisible by p. One of the remaining (p — 1) incongruent 
residues (mod p) satisfies 


3l + 2A, = 0 (mod p) 
Hence there exist (p — 2) incongruent values of Ll for which 
(31 + Ae)(3l + 2A.) #0 (mod Pp) 


We use one of these values of Lin the transformation and we obtain AsAy ¥ 0 
(mod p). 
Henee tn all cases we can satisfy the condition that AgAs be not divisible by p. 
We require a primitive solution of (6); Le., at least one a,Qo(r) prime to p 
Suppose that no one of the aQ(vi) which we have used satisfies the desired 
condition. Then choose 25 so that asQo(vs) is prime to p. This we ean do 


"fay + ay 0 (mod p), the treatment is the same as when @ + @e 0 (mod p) 
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since we have just seen that if A; is divisible by p we can make a transformation 
on x to give Q,(z) in which A; is not divisible by p; and let 2; be divisible by p. 
Then a;(Q(zs5) is divisible by p while a;Qo(2s) = a5A3 (mod p) is not divisible by p. 

We have then shown that for p > 3 there exist primitive solutions of the 
congruence 


6 
D aQ(xi) =n (mod p) 
i=] 

if for every prime p > 3, a,,--- , a are all prime to p. We have thus proved 


Theorem 2 for primes p > 3. 

For p S 3, James used in the congruence (6) fewer than 9 equal values of the 
polynomial and, if necessary, one additional value of the polynomial which 
insured that the primitivity condition be satisfied in order to solve the con- 
gruence for an arbitrary integer n. If we assume that our set (a, ,--* , ds) 
has the properties S(2, 2°) and S(3, 3°), the discussion given by James applies 
to the present problem.” We have thus proved Theorem 3. 


9. Results for the quartic polynomial. We consider P(x) as in (3). As in 
the case of the cubic polynomial, we may assume that a, b, c, d and e have no 
common factor other than unity. We prove Theorem 4, a congruential theorem 
for the quartic similar to Theorem 2 for the cubic, and give in Theorem 5 our 
final results for the quartic. 

THrorem 4. Lf the polynomial P(x) and the set of positive integers (a, , +++ , as) 
satisfy the following conditions: 

(a) for every prime p > 3, eleven of the constants (a, , +--+ ,a,) are prime to p; 

(b) the set (a,;, +++ ,a,) has the properties S(2, 2‘) and S(3, 3°); 

(©) the coefficient of x in the normal form of the polynomial is not divisible 

by a prime of the form 4m + 3 (p > 3); 

(d) the polynomial is not of the kind excepted by Miss Humphreys, 

then for every integer n and for Q(z) = Plvx + t) there exist primitive solutions 


of the congruence 


21 
12, } a, Q(x) " (mod yp’). 
i=l 
Sunonem 5. Lets = 2) be an integer; and let P(x), the quartic polynomial in 
(3), and the set of positwe integers (a,, +++ , a.) be such that the conditions (a), 


‘by, (e), (d) of Theorem 4 are satisfied. Then there exists a constant Cy, depending 


ons, a,b, ¢, d and max a, such that every integer n > Cy is expressible in the form 


7 > a, Ps). 


v1 

‘Qur hypothesis makes Laeminas 12, 14, and 14 of James’ paper available for our dis 
cussion. The use of Lemma’ can be avoided. Sections 5 and 6 of James’ paper then give 
he Gerire result 


‘The normal form of the polynomial is discussed in the next section 
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10. Congruential theory for the quartic. Suppose that Q(z) is of the form 
Q(x) = Box* + Bix + Bex” + Bsx + By, 


where 8; (¢ = 0, 1, --- , 4) are integers and & > 0. This we can assume after 
transformation of P(x) in (3) by replacing x by vr + ¢t. As in the case of the 
cubic we may assume that 8) = 1 (mod p). Moreover, if k is prime to p (this 
is the case for p > 3) we replace z by X + z where z is so chosen that the coeffi- 
cient of X° is divisible by p. Finally, after this transformation, we may evi- 
dently assume that the constant term is zero and consider (12) for Q(z) re- 
placed by 


Q(z) = a+ Asx’ + Asx. 


We call Q(x) the normalized form of the polynomial Q(x). Then condition (c) 
of Theorem 4 implies A; # 0 (mod p = 4m + 3, p > 3). 

We consider the congruence (12) for primes p > 3. Then @ = Oandy = 1. 
Hence we consider 


s=21 


(13) De a;Qi(r,) = n (mod p), 
inl 
where we assume that (a; , +--+ ,@) are prime to p. By Lemma 39 of a paper 
15 . —_— ° 
by Huston,” there exist primitive solutions of 


4 
pe ahi =n (mod p). 


t~l 
Put 25 = hy (@ = 1,°-- , 5), ri = hye (6 = 6 --- . 10), where y and ye are 
prime to p and where the h; are so chosen that 


’ lo 


(14) > ahi = (= > ahi (mod p 


i=l oe 6 


and in each congruence at least one ajh; is prime to p. 
Case 1. AsAs # O(mod p). Suppose first that the values of Ay ( h.->+ 8) 


‘ ot ‘ — . . 
which we have chosen to satisfy > ayh; = 0 (mod p) also satisfy » ayh; 0 
i! en | 


> 


(mod p). Then by choice of hy or —hy we can have Ss ah, not divisible by p 


— 
i=l 
This is true since we may take ayy prime to p and since if (Ay. , As) satisties 
Ny o > 
Lahti = LVahi = Dah =0 (mod p), 
ie | | aml 
then (—hy, he, > s> Ay) satisfies the first two congruences but msures that 
> . . 
>: ayhy is not divisible by po Hence 


"RW. Tlusten, Asymptotic generalizations of Warcng’s theorem, Proceedings of the 
London Mathemationl Society, (2), vol. 80 (1985), pp. S211 
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D aiQi(xi) = LVashiys =n (mod p), 
i=l i=l 
which is solvable for every integer n. To satisfy the condition that the solution : 


be primitive it may be necessary to add one more summand, a¢Q;(xe), where 26 
is divisible by p while 





agQi (xe) = ac(4x° + 2Aox + A;) #0 (mod p), 
since p does not divide A3;. 
Suppose next that the values of h; (¢ = 1,--- , 10) which we have chosen 
to satisfy (14) are such that 
5 lo 
> ahi F 0, > a:h; #0 (mod p). 
i=l i=6 
Then 


10 By 5 
D> aQix) = A(X ashi) + A(X ash) 
i=l 


(15) i=l t=1 a 
+ a( ahi) ys + a(X uh.) Yo (mod p). 
i=6 i=6 


Put ye = Ye + Zs, where Z, is so chosen that the coefficient of Y, in (15) is 
divisible by p. Put y: = Yi + Z, where Z, is so chosen that the coefficient 
of Y,; in (15) is divisible by p. We then have essentially 


10 
> a;Q\(z;) = rYi + s¥3 =n (mod p), 
i=! 


which is solvable for every integer n since 


5 10 

. . 

r= Ag > ah; and 8s = Ag :s ah; 
i=l 1=6 


are both not divisible by p > 3. As before we may add one more summand, 
4:Qi(21;), divisible by p to insure that the primitivity condition is satisfied. 

Case 2. As = 0, A; # O (mod p). Evidently this case is equivalent to the 
first part of the preceding case since Az divisible by p has the same effect in 
the congruence as 


5 


7. ahi =0 (mod p), 


1 


Case 3. As A; 0 (mod p). Then 


> a,.Qy(a,) m= 0,07; + Gere + °°: + 25 7 (mod p) 


j 


has a primitive solution by Lemma 29 of Huston’s paper. 
Case 4. As # O, As 0 (mod p 4m + 1). Choose A; so. that 
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5 


5 
pa ah; = 0 (mod p) and > ash; # 0 (mod p). This can be done since & = 1 
i=l 


i=l 
(mod p) has at least one root & such that & #4 1 (mod p). Hence if 
(hy , ho. +++ , As) is a solution of 


| 


8 3 ’ 
> ahi = > ah? = 0 (mod p), 
i=1 


i=l 


then (fh; , he, +++ , hs) is a solution of > a;hi = 0 (mod p) but does not satisfy 


i=l 


> ah? = 0 (mod p). Use this latter set of h;. Then we can obtain 
i=l 


i=! i=6 i=l i=6 


5 10 BY 10 
. a, Qa) + om a;Qi(x;) = a(X ashi) ui + 4(X ashi) =n (mod p), 


0 
which is solvable for every integer n since neither r = A,( ait) nor y = 
i=l 


le 
4(2 a;h;; ) is divisible by p. 
i=6 

The proof fails when Ag is divisible by a prime of the form 4m + 3. This 
gives rise to the exception in the theorem. An additional summand, ayQ(ru), 
can be added if necessary to satisfy the condition that our solution be primitive. 

Hence for p > 3 we have the congruence (12) solvable for s 2 11, with the 
one exception noted. 

For p < 3 we assume that our set has the properties S(2, 2°) and S(3, 3°). 
This assumption reduces the discussion of this case to that of Miss Humphreys. 
Her proof for the quartic case, p S 3, a, = ag = --- = a, = | then applies to 
the ease under discussion. Hence we have Theorem 4. 


11. Proofs of Theorems 3 and 5. If So, 2 » > 0, where » is independent 
of n, it follows from Theorem 1 that r(n) > 0 when 


n> Ce = ((Casad “(ay ae a,“ T(s k)/nt(1 + 1 ky)". 


The proofs of Theorems 3 and 5 are thus reduced to the proot of 
Lemma 5. If for the cubic polynomial s = 9 and if for the quartic polynomial 
s 2 21, then 


Son 20 > O, 


where n is independent of n. 
° l , 1 , . -_ : 
Since Mo(p') = N,,(p) we have from Lemmas 2 and | and Theorems 2 and 4 
in the respective cases 


> Ay(q) p'""MA(p’) = p ”’ N.,(p’) 
(16) iP! bees 


p p Ase 28. 
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where t = 4 for P(x) in (3) and t = 3 for P(x) as in (2). 
holds since in the respective cases (l > 4 = y), (1 >3 = y). 


Then by Lemma 4 
with e = 1/8 


(17) p> Ao(q) = 1+ DL Ao(p') > 1 — E2 > pp” = 1 — Bp — 1)". 
a\p \=1 l=1 

Finally, the singular series is convergent for s = (k — 2)2*"' + 5 and hence by 

(16), (17) and Lemma 3 


S,. =lim D> Adq) =lim J] p 3 Ad(q) 


x 2 < | 
Fe sgt. -p! I~ psp, |p 
1 


IT max (p -?, 1 — Ee(p™® — 1)"’) 
p 


IV 


=2n> 0. 


UNIVERSITY OF CHICAGO. 


The final inequality 




















TRIGONOMETRIC APPROXIMATION IN THE MEAN 
By E. 8. QuapE 


The following theorem is stated without proof by G. H. Hardy and J. E. 
Littlewood.’ 

TueoreM. The class Lip(a, p) is identical with the class of functions f(x) 
approximable in the mean p-th power, with error O(n“), by trigonometrical poly- 
nomials of degree n. 

They remark in addition: This approximation may be made in general by the 
Fourier polynomials of f(x); the case p = ~ , in which this is not true, is exceptional. 

The initial purpose of this paper is to examine the range of values of p and a 
for which this theorem and remark are true and to supply proofs. In doing 
this, related theorems are obtained in which the approximations are in terms of 
the metric of a more extensive space than L, and in which the functions that 
measure the degree of approximation are more general than nn“. These 
theorems and their proofs parallel to a large extent the theorems given by de la 
Vallée Poussin’ and Dunham Jackson* for the class Lip(a). 

We assume throughout that our functions f(z) are periodic with the period 
2r. The functions &(u) and ¥(u) are of Young’s type.‘ That is, @(u) is non- 
negative, convex, and satisfies the relations (0) = 0 and ®(u)/u—> x asu— ~; 
W(u) has similar properties and is such that Young’s inequality 


uv < &(u) + Vu), u,v 20 


holds. Throughout the paper we will write ® ul ,Wiui,for®(u ),¥G uu) ). 
Qe 


If f(x) is measurable and such that ® f | dx exists, f(x) is said to belong 
0 


to the space Le(O, 27). Lf f(x) is such that the product f(x)g(z) is integrable 
for every g(x) « Ly, then f(x) «€ Le. For this space 


| f |le = sup [ S(xdg(x) dx 


for all measurable g(r) with p, = [ VW gi dx < 1. This space’ is linear, 
0 


Received October 5, 1936; in revised form, July 9, 1937. 

' A convergence criterion for Fourier series, Math. Zeit., vol. 28 (1928), pp. 612-634; in 
particular, p. 688. 

2 Legons sur U Approximation des Fonctions, Paris, 1919. W 
as (P). 

> The Theory of Approximation, New York, 1930. We shall refer to this treatise as (D). 

*A. Zygmund, Trigonometrical Series, Warsaw, 1935, §$§4.11, 4.142. We shall refer to 
this treatise as (Z). It contains extensive bibliographical references to original sources. 


6 (Z), S4.541. 


» shall refer to this treatise 
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metric, and complete. If f(z) « L$, we put, for 5 > 0, 


wold; f) = - . || f(z + h) — f) |le. 


For p > 1, L,isaclass L$. InL,,p = 1, 


Cle l/p 
w,(6;f) = sup ||f(r +h) — f(z)||, = sup (| | f(a + h) — f(x) "dz) , 
O<jh| sé O< hj <b 0 
If w,(6; f) = O(6%), 6 — 0, f(z) is said to belong to the class’ Lip(a, p). The 
limiting case of Lip(a, p), denoted Lip(a, ~) is identical with Lip(a). For 
brevity, we shall write | f || for | f ||}@ whenever it will not lead to confusion. 
In order to prove the next theorem we need the following two lemmas. 
Lemma A. If t,(x) is a trigonometrical polynomial of degree n at most, then 


’ 
tn @ S 2n\t, ®. 


This lemma is a trivial extension of a theorem of Zygmund’ that, when 
@(u) is a Young’s function and ¢,(x) a trigonometrical polynomial of degree n, 


then 
ws t. 2r . 
| ? dr < i ® | t, | dx. 
0 n 0 
If t.(z) = 0, th | = |) t, |) = 0. Assuming t,(7) 4 0, we have, sinee || é, || 2 


| t, dx > 0, by Young’s inequality and the above theorem of Zygmund 
0 


U ' on Ul 
tr "ob 
-= nsup | (x) g(r) dx 
; » nil 


f, 7] n 


" t 
<n ? dy + pg 
0 n i t,, | 
2s t, 
sn || a t. |! dar + m | 


- 


Since p, = 1 and’ 


the lemma follows 
This lemma holds also for the cases’ L,, p and p x Which are not 

* 

spaces of type La. 


* (Z), §§4.76, 4.77, 4.78 

7 A remark on conjugate series, Proceedings of the London Mathematical Society, vol. 34 
(1932), p. 396. 

*(Z), (4.541 

*(Z), $7.31 





BE ROM carry tis eT ee 

















TRIGONOMETRIC APPROXIMATION IN THE MEAN 531 


Lemma B. If f(x) is absolutely continuous and has a derivative f'(x) « L3(0, 27), 
then 


‘| f(x + h) — f(x) |le S 2hh\-\| fle. 
We have 


A +H) — HO = iAL-isttidoup| [~ 9@| + MESO) a) 


hii f’|| 
<jh|-\\f") [ ot — IO a 4 1) 
o h\\ f" || 
We now need only show that 
* 4 St +h) — $0 
[ b hilf’| d 


Let e,(t) be defined for each h # 0 such that e,(t) = 
elsewhere. Then 


t 


IA 


Il 
lA 
A 


<= jh!|, and zero 


aS th) —f@) _ 41 [* e+) 
b hip’ =o; | pi 


I et) 2 Ut 2) 
Jo if li 


[ e,(t) dt 


> 


II 


l . I(t + x) 
= AO l 
‘ h | ; ig ' 
by Jensen’s inequality.” 
Consequently 
| e| Mth Mla | | en(t) dt | oft aes 
0 hi\f | h Jo Jo \f | 


since 


= | 


Poe a ) P 2s tw 
| o|fet 9 4, . | a FX 4 
J | f’ | Jo if 
beeause of the periodicity of f(r). 
Let Q(7) be a funetion not identically zero which satisfies the following 
conditions, 


(i) Q(7) = O and, at least for x greater than some 2), decreases monotonically 
to zero as 7” &%, 


- *“ Q(x) 
(11) | Ma dx exists 
70 wr 


© (Z), $4.14 
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We are now prepared to prove 
TueoreM 1. If the function f(x) can be approximated, for each n = 1, by a 
trigonometrical polynomial T(x), of degree n at most, such that 
Q(n) 
WP 2 
lf T,.\\e S nr’ 
where r is a positive integer or zero, then f(x) is equivalent to an absolutely continuous 
function having a derivative, f(x), of order r, for which 


a/lé "2 € 
(1) wold; f') < A ls / Q(x) dx + | Q(z) az, 
a 1/6 r 


where a and A are constants which may depend on f(x) but not on 6. 

In the case r = 0, we understand f(x) = f(z). Two functions in this space 
are equivalent if they differ on at most a set of measure zero. The statement 
that f(z) has a derivative f(r) means that the derivatives f(z), f(a), 
f° (x), «++ f(z) exist on (0, 29), the last almost everywhere, and that f(z), 


f(x), F(x), «++ f(z) are absolutely continuous. 

The method of proof is like that used by de la Vallée Poussin” for the case 
p= % 

Set 


R(x) = f(x) — T,(x), 
o(r) = Rir(x) — Rae+i(x), 


where a is an integer 2 2 such that Q(x) is non-increasing for x 2 a. Then 

Raz(z) = f(x) — Tax(x). Since T,2(x) is a trigonometrical polynomial, 7',2(x) 
. . . . oa 

and its derivatives of every order are in Lo and by Lemma B, 


Ti? (a + h) — Th’ (x) || S 2h h{-|| TET II, 
so that we(6; 742’) S Mé, where M is a constant independent of 6. The theorem 
will be proved if we can show that /,2(z) satisfies the required conditions. — For if 
R(z) is absolutely continuous and equivalent to R,2(x), then R(z) + 72(x) is 


absolutely continuous and equivalent to f(r); moreover, if R(x) exists and 
satisfies a relation of the type of (1), f = R™ + T{P will satisfy (1) since 


wold; f'”) < weld; R°’) + weld; Th’) 
and Ain (1) may be chosen arbitrarily large. 


nn 
lety 21. Since Ry = Do de + Rar, 
k=—2 


ntl 


., Ofa""") 


qgintbir " 


Re — >. dell S || Revs 


(P), §39. 
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~~ 
This implies that = o: converges in Le to Ra. By Lemma A, 
k=2 


k 
(2) ibe |] S 2a***|| del] S a***(|| Raw || + || Rae+s ||) S 4a je 
and consequently 
x 2 a k k kl 2 aa 
(3) = ell < 4a" > Q(a )(a* i < da Q(z) ee 
k=? a—1 ime e a—1l Ja y 


eo 
ome.s. « - Ww ’ . * . . * = 
This implies” that > er converges in Lg to a function in Le, say p(x). Set 


k=2 


(4) R(x) = [ p(t) + R2(2), 
where the point # is such that b o(%) — R.2(¥); this is possible since there 
k=2 


m 


must be a subsequence of {> on} which converges to R,: almost everywhere. 


lem?) 
We have,” since 
(5) Do —p' dt Ss | Yd —p'| 0, 
Jo k=2 k=2 
(6) bi (t) dt — [ p'(t)dt = R(x) — R,2(2). 
k=2 z z 


Thus from the equation 


(7) Yo) => | gilt)dt + D> ox(2), 
k=2 z k=2 


k=? 


we have, by letting n — «, Ra:(xr) = R(x) almost everywhere. 
Consider r = 2. By Lemma <A, corresponding to (3), 


cs) oo 3 x 
2a) Q(z) 
1, (2) 4 , k+l) J? (2a Ax e . 
Dd || 9: | Ss 20a Px || S a—o—i dr< 2, 
kom? ear) a-—-lj, 2 


2 
—— ‘ 9 : * ‘ ‘ . . (2) 
this implies that > converges in Lg toa function in Le, say p~ (x). Pro- 
ken? 


ceeding as in the case of p(x), we set 


(8) v’ (x) | p(t) dt + p'(2), 
where ¥, }m,!} are so chosen that yx oi (F) » p'(#). Then, by exactly the 
k=? 


28. Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 37 
"EC. Titehmarsh, The Theory of Functions, Oxford, 1982, $12.58 
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argument used for (4), we have R’(r) = p’(x) almost everywhere. Thus 
ao 

, . * . . 
2 o converges in Le to R’ and we can denote the derivative of R by R’ 
k=2 
instead of p’. 

For r 2 1 we have shown the existence of an absolutely continuous function 
R(x) equivalent to R,2(r) and having a derivative R’(x) which, for r = 2, is in 
turn absolutely continuous. 

The existence of R™ now follows by induction. Let s < r, and suppose 

l . ° ° ° . ° ( 
R°*” exists, is absolutely continuous, and has as its derivative p(x) defined as 
x 
belie) oy * . s) wea.s ° ° 
the limit in Lg of 2. ¢.’. This hypothesis implies the existence of an 
k=2 
absolutely continuous function, R” (x), equivalent to p”’(z) which has as its 
a 

‘ . : , e+ . Pe ae.” gy * #1 
derivative the function p“*’ (zr) defined as the limit in L¢ of » o.*?(z). 

k=2 


The proof ix as follows. By Lemma A 


e>l (* 


9,71 
(9) od < 2a od; 


‘ = wy 11 Pe |), 


lA 


and, corresponding to (3), we have 


x“ . x ( k ‘ a+2 2 /, 


of ak’? a—l) a <= ] ar" 
Zz 
ra ° : y+] . - . . . * (atl) 
Phis implies that > o (x) converges in Le toa function in La, say p* (2). 


Corresponding to (4) and (8) we set 


l] R(x) = / Pg , (t1)dt + p (2), 

where Z, |m,{ are so chosen that ys % (f) — p (ZF). By the argument used 
i= 

neases s = O ands = | with equations corresponding to (5), (6), (7) we have 

R(x) p(x) almost everywhere. 


(0) 


When « / 1, we denote p“"" (x) by R(x). For r = 0, R''(r) R.2(2), 
Vi rad) We must show that 2” (zr) satisfies (1). Sinee, for r = 0, » oy 
k=? 


» I” in Ls we have from Lemma Band (9), 


Rae + h R'(z)\i s > gd, (24 +h) (x) 


+ YK gbi'(e +h) — of") II 
mel 
S 2h) Dior’ | + 2D Iles 
h~-2 mit 
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a-—-l- 2— 1 r 


< 2 hla’? > a 200") + 2770" > a(a*) 
k=? m+1 
2(2a)'"** isd 2(2a)"*' [* Q(x) 
< 220) Ih | Or) dx + 22% I M7) ae 
1 tf a” 


. 


7 4 ° sa: m—1 el as m 
If m is now chosen subject to the conditions @” ~~ S 6 < a" and A 2 
+2 1 
2(2a)’**(a — 1)", we have 


ajé Px Q ) 
wd; RR’) < | 6 | Qir) dx + | _— iz], 
a 13 oi 


. 


If the Q(z) considered in Theorem 1 satisfies only the hypothesis (i), namely, 
that Q(r) | 0, and not (ii) also, then the condition f — T, , S Q(n)n™ 
does not necessarily imply that (1) holds. In fact f‘’(r) may not even €L3. 
For consider the space Le and let 


x 


fix) = i QA) + sin ke. 


k=l 


Then, if s, =s,(2;f) is the n-th partial sum of the Fourier series of f(x), 


2 t 23 2 ; 
Tey > — < axn) ( 7 7 < Qn) 
A k=n+l ki! a. tent Kk F rn 


But f(x) = e O(k)k eos kr may not even be in Le since 
kewl 


S'\ls = | call 


does not necessarily converge; for example, if Q¢r) = [log (re + 1) 

We remark that the theorem just proved holds for L, as well as for Le (and 
thus for L,, p > 1). The proof for 1, is the above proof with the L$ norm re- 
placed by the 1, norm since the inequalities of Lemmas A and B hold also 
for Ly. 

As an immediate corollary of Theorem | we have the positive assertion of the 
following theorem. 

Turonem 2. If the function f(x) can be approximated for each n = 1, by a 
trigonometrical polynomial, t,(x), of degree n at most, such that f t 
O(n “), p = 1, then 


(1) uf 0 < a < 1, f(x) € Lipa, Pp); 


‘ l 
(u) ifia 1, w,(6; f) (3 log ') 
5 


Moreover there exist functions for which ff — bt, O(n “) which do not belong 


lo Lap(l, p) 
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. * on 
In the previous theorem we choose r = 0, Lg = L,, Q(7) = Ma“, where 
M is a positive constant. Then 


a/lé oI \ 
wp»(6;f) = ofs [ x“dx+ [ x! “ dz}. 
a 1/8 


O(5*),0 <a <1; 


o( log ), a=t1. 


If f(z) « Lip(a, p), a = 1, p > 1, then f(z) is equivalent’ to the indefinite 


This gives 
(i) w,(6;f) 
(ii) w,(6; f) 


Il 


integral of a function in L,,. Consider the function f(z) = )> net. We have 
n=1 


~ a 
If — salle = ( > r*) m2 nn, 
v ~1 


y= n+ 


But f(z) is not in Lip(1, 2) since 


and f’(z) is not in Le. Indeed, for h > 0, 


~*~ ; wo - 2} 
\| fla +h) — fle — h) lle = (= a ain nh) . (X a[siar*]) 
1 


n= 


so that 


j 
iI fle +h) — fle — h) |b ¥ 0( a tog | ) 


If a > 1, we may take Q(7) = a2 “, where r is an integer such that 0 < 
a—r<il. Then f(x) exists and the conclusion of the theorem holds with 
f(z) in place of f(z). 

We now turn to the consideration of theorems of the converse type. 

THrorem 3. If f(x) « L$ possesses a derivative of order r, say f(x), in Le, 
where r is a positive integer or zero, then, for any positive integer n, f(z) may be 
approximated in Ly by a trigonometrical polynomial t,(x), of order n at most, 
such that 


:; ¢ 1 r 
If — talie = o(» w(tss )). 


14 (Z), §4.7, (8). By the use of lacunary series and the inequalities of (Z), §§9.601 and 
4.602, we need not restrict the example to the space Ll» . 
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To prove this theorem we use the method and notation of de la Vallée Pous- 


, . 2t 
A= BH and ¢(t) = > ass (z re >). 


. 15 y 
sin. We set 
k=0 


the A + 1 constants a, ,k = 0,1, 2,--+ , being so determined that ¢(0) = f(z), 
¢(0) = 0, s = 1, 2,---.\. The trigonometrical polynomial ¢,(z) is defined 


by the equation 
1 sin t\/sin t\?** 
s = ed lt, 
ie) = 5) [. (5) ) . 
© /sin t\?** 
r(A + 2) = i dt. 


The order n of t,(x) is (A + 2)2*m — 1. Since o(0) = f(x), we write 


t,(x) — f(x) = . [ F t)\(sin nae 
i — io 5. m t : 


where F(t) = o(t) + o(—1t) — 2¢(0). 
With these definitions 


2 l - 2t 
¢"() = p> a5 nf (« + x) 


PF? = 6° + 6°°(—d — 26°"), + even; 
= ¢°(0) — ¢(—d), r odd. 


where 


and 


Consequently, 


° bs | ‘ 9 ' 
FPO I Ss DL sear f(a + =) +52 - =) — 2f(2)\, reven, 
| 
. r 2t r) (r) (r) 2t 
< > stl. fe‘ (= + =) _— f' (a) +f (x) —f (« a =) _ ¥ odd, 


or, finally, 
F(t) |) = O(we(2t; f°”)). 


We also have, for r = 


A 
F(t) = [ oe [ 7 F (uw) dudt, g eee dts dt, 


© (P), pp. 47-50. 
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so that 


\| Fo | 


IIA 


t ty "tr—i : 
| — | \| F°" (wu) iI dudt,_, «++ dtedt, 
t ty "tpi 
o| | [ tee | we(2u; f°”) dudt,1 aes dts |. 
0 0 JY 


t ty (Ptr—y 
of 2 [ vee | wa( 2 f°) dutty vee its | 
m” Jo 0 0 m 


Since f(x) is periodic, 


2u (r) 
Woe| — ; = 
(® J 
Thus we have 


( t ty tr-1 
r(£) - ot w(? f°) [ vee (Qu + 1)dudt,_, --- itd 
m m m 0 0 0 
= O.m ‘tw 1. )) 
oe “*\ im vd ) 


l hae sin t\** 
It. -fll s r(“ at 
T(X + 2) J, m t 
( +m . a+) 
OLm ‘ws( :s*) (* ) dt} 
m Je t } 
( | ; 
Ovm wo( f°) 
m i 


- , 1 
Since m = (n + 1)(4 + 2) 2", we have 


( . | - 
i_- f =O<n we + (- 
n ) 


‘THronem 4. Jf f(x) € Lip(a, p), p 2 1,0 < a@ S I, then, for any posttwe 
integer n, f(x) may be approximated in L,, by a trigonometrical polynomial, t,(x), 


ll 


Thus 


= 
, 
s\~ 
ee 
I 


A 
w 
+ 
£ 
Ps 

Pay 
ea 


Now 


Il 


il 


of order n such that 
 j Ge tle On “) 


We put 7 0,4 = 0, andw,(n';f'’) Mn * in Theorem 3. For p > 1, 
the result is obtained by taking L$ tobe L,. For p = 1, the result is obtained 
by carrying out the proof with 1, in place of is. 

Theorems 2 and 4 give the proof and range of Hardy and Littlewoods theorem 
The question concerning the remark following the theorem is answered by 


Theorem 5 
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In the following s, = s,(f) = s,(z; f) denotes the n-th partial sum of the 
Fourier series of f(x) and ¢, = ¢,(x;f) denotes the n-th (C, 1) mean of s, . 

Lemma C. If f(x) € L, and t,(x) is an arbitrary trigonometric polynomial of 
degree n = 1 at most, then 

i) fp>1iif—s |, Alif—tili,, 

Gi) fp=1, f—s,. 1S A + logn) | f—tii, 
where A is independent of f(x) and n. 

We may write 


f-sli=||f—-t+4.—8 || Sif —t.|| + lle. — tl. 


The tngonometric polynomial s,(27; f) — t(z) = s.(a; f — t.). Hence for 


p > 1. by an inequality of M. Riesz,"° we have 
es -—t.||SAN!f—tli, p>. 
When p = 1, we have 


lls, —¢,|| = ; i I [f(a + uw) — t.(2 + uw) D,(u) du’ dz, 
wT fv 0 


where D,(u) is the Dirichlet kernel. Interchanging the order of integration, 


we have 


Wf — te if | D,(u) | du S ACL + log n) || f — ta] 
0 


Tr 


| 
IA 


: “i 
since 


| D,(u) | du? . log n. 
0 rT 


T 


‘Turorem 5. /f f(x) € Lip(a, p), 0 < a S 1, then 
(i) when p > 1, f — sa \\p = O(n“); 
(ii) when pp = 1, ff — s 4 = O(n “ log n). 
For p > 1, a < 1, O-large cannot be replaced by o-small and there exist functions 


° ° ' l 
m Lip(l, |) for which | f — s, |) = o(n © log n). 
The positive assertion of this theorem is a corollary of Theorem 4 by applica- 


tion of Lemma C. 
To show that O-large cannot be replaced by o-small for p > Lt, a < 1 consider 


the funetion 


a om 
° COs 36 TI 
d= 2D 
Jka ~ ama? O<a<l 
 (Z), §7.3, (1). Sinee js, {| S isk i] + ii, p > 1, we have |] s, || s (Ap + DUS] 


(Z), §8.3. 
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This function” is in Lip(a) for each @ and, a fortiori, in Lip(a, p) for each « 
and all p. We have,” however, 


lf — sm || = B( Zz ¢:) # o(2°"*). 
ven+l1 o” 
The function 
re — sin yr 
f(x) = > : 


is in the class Lip(1, 1). Making a slight change in notation, we set 


re cc ? 4 
t,.(x) = tn(x;f) = hn f f(x + t) (= in’) dt, 


m sin $t 


ve we sin 2mt e 
hn  j-«\msin 3t ; 


Our definition” of t,(x) differs slightly from that of Theorem 3. 
For 


where n = 2m — 2 and 


f(x) = > Sih A 


v=l v 


' sin 3ma \* 
t,.(z;f) = rhal : ) — }. 


m sin $x 


we have 


To show this we write 


" S sin vf sin 3 4 
t.(z;f) = hk, :% in ve + "( ah di 
~~ v m sin $t 
“sinve [* sin }mt \‘ 
= hem : COS vt 3 dt. 
v~l v g m sin ht 


" r P 1 4 
ti.(x;f) = he = COS Vr cos vl (= dt. 
vl : m sin 4t 


Then 


* (Z), §2.9, (3) 


19 (Z), §9.602, (1) 
26 We are here using the notation of Dunham Jackson, ()), page 3, rather than that of 
de la Vallée Poussin. However 


os , f sin gmt \* 
he f han flr +1) f(r)} ; dt 
- m sin Mt : 
“ sin 4mt \* ! 
= OL m t) . dt 0 
. m sin 4 n 
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This equation implies 





’ sin }mz\' 
t,(a;f) = thn (3 a ) — }. 
m sin $x 
Now 
nm 
, 
s(x; f) = Lo cos vz = Dalz) — 3, 
v=l 


where D,(x) is the Dirichlet kernel. 
By an inequality of F. Riesz,” 


1 , , l i sin 3 ‘ 
—|[t. — 8, || = - | D,(u) — rh», (is) du 
n nj « m sin 4u 


; a ; rh, [* (sin 3mu\ logn 
- | D,(u) | du — — | ———~iae- | Gu w A -_=— =, 
% Jos n J = \msin fu n n 


rm™M.: : 1 3° 
Phis gives | t, — s, | # o(n” log n). Since 


IV 


I|tn — sn || 


IV 


WS —tall eis — 8. || Sit — and f—t.|| = On), 
we have || f — s, |, # o(n log n). 

TueoreM 6. Jf f(x) € Lip(a, p), 0 < @ S 1, then 

(i) if p> ] or ifp= lLla< 1, | f- oc, || = O(n *); 


n 
For the case (i), O-large cannot be replaced by o-small. 
For the cases™ p 2 1,a@ <1, and p = a = 1 we write 


re EL I [fa + 0 — f(a) K.(t) dt | dz, 
wT Jo 0 


where K,,(t) is the Fejér kernel. The result follows by Minkowski’s inequality” 
since @ 


[ “K,(O0dt = O(n *), a<l; 


= O(n 'logn), a= 1. 


When p > 1, f(x) « Lip, p) is equivalent to the indefinite integral of a function 
in Lb Since 


lon — sell = —-, WIS A IS I = OY, 


2 (Z), $7.31, (b) 

2 The case p > Land « < 1 was obtained by O. Szdsz, Uber die Fourierschen Rethen 
gewisser Funktionenklassen, Mathematische Annalen, vol. 100 (1928), pp. 5380-536 

"9 (%), $4.18, (4) 
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ail P . ° . ‘ 
where &,(z) is the n-th partial sum of the conjugate derived series of f(x), the 
result for p > 1, a = 1 follows from Theorem 5 and the inequality 


lf-—o,.||} Ss \|\f—s,. || + Sn — On 


To show that O-large cannot be replaced by o-small in (i) we apply Lemma C 
to Theorem 5 for the case p > 1, a < 1. For p > 1, a = 1 consider f(x) = 
cos x. This function belongs to Lip(1, p) for every p. But 


f-oall = 4-"T14) = hist «o(2), 
n n n 

Corresponding to the case“ p = ~, if f(x) can be approximated in L,, p = 1 
by s,(r; f) such that the order of the approximation is w(n), it follows from 
Lemma C that no approximation of f(z) by a trigonometrical polynomial! of 
order n can, for p > 1, be o[w(n)], and, for p = 1, o[w(n)(1 + log n)“}. 

Also corresponding to the case” p = x we have 

THEOREM 7. A necessary and sufficient condition that a function f(x) periodic 
in 2x belong to the class Lip(a, p), 0 < a < 1, p 2 1, ts that o,(x; f) belong 
to Lip(a, p) uniformly in n. 

The necessity follows immediately from the inequality || o, |, Sf) and 
the sufficiency from the same inequality on the application of the Fatou lemma. 

THeoreM 8. Let F(x) be periodic in 2x and the indefinite integral of a function 
S(r)eL,,p 21. Then 


aoa ae | 
(i) \| F s,(F) | > n+1 if Snlf) ||, p>; 


A 


n 


(ii) \F ne sn(F) | < a(' + log "iis = s,(f) (|, p = 1. 


Let 
x 
fiz) = be a, cos vr + b, sin vx ° 
y= 


(in order that F(z) be periodic aj = 0) and let f(z) denote the conjugate function. 
Choose m > k > n. We may write 


k 


s.(z; F) — s,(2; F) = # ; (a, sin vr — b, cos vx) 


n+l 


1 “ a 
= a+ j [sm(as DD oni sn(x;f)] 


k 


1 
‘ 1 . j 
2 wy + py fam(a J) — sa; f)] — j ism I) — s(x; f)). 


“(P), p. 22. 
26 (Z), §4.719. 
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Then 


l Ma. —— ; I 
n+1 1 su(f) sn(f) i 


+s 


nyt vy ; 1) 


First let m— » and then k— ». We obtain,” for p > 1, 


i llF- wD +2 p lie - 


n+l 


\]s(F’) — s,(F) || 


I) 


I sm(P) — 8.(P) || + Zllm(F) — sa(P) I 


| F — s,(F) || 


lA 


n+ v(v 
1, | — 2 
is - eas s(f) || 


™ 


und (i) follows since, for y > n, || f — s(f) || S Mf — sf). 
The above proof is due to the referee; it eliminates the log from the first 
part of the theorem. For p = 1 


wo 


1 ; 
D —(a, sin vx — b, cos ve) 


v=l V 


= [ {f(x + t) —s,(r + roid me) ae 


The interchange of summation and integration is possible since f(r) « L and 


> se . y 
> »' sin vx is boundedly convergent. We have 


n+l 
|| F — s,(F) || < Lf" ([" aca i.) |e) > os Q 


*\if- s.(f) || a | en at 


F(2) — s,(2; F) 


II 


lA 


IIA 


By Theorem 5 


[ > sin a dt < a(! tp log ") 


si ntl n 
and the theorem follows. 
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A NOTE ON NON-ASSOCIATIVE ALGEBRAS 


By N. JAcoBson 


It is the purpose of this note to obtain relations between an arbitrary algebra 
MN (not necessarily associative) and an algebra Y% (necessarily associative) of 
linear transformations determined by W%. If % is simple, the centrum € of % 
ix an algebraic field and 9 may be regarded as an algebra over ©. When this 
ix done 3 becomes a normal simple algebra, i.c., remains simple when this field 
is extended to its algebraic closure. A field having this property for algebras 
of characteristic 0 has been defined previously but less directly by Landherr.' 
Some of our results have been announced for Lie algebras of characteristic 0 
by Albert 


1. Let 2 be an arbitrary algebra (not necessarily associative or commutative) 
with a finite basis over a commutative field &; 9 is a finite dimensional vector 
space over ® in which there is defined a composition zy of pairs of elements 
zr, y such that 


(1) (x + y)z = rz + yz, 2(x + y) = 2x + zy, 
(2) (ry)a = r(ya) = (ra)y, aed, 


The mapping z > za = rA, of R on itself will be called the right multiplication 
determined by a. Equations (1) and (2) show that A, is a linear transformation 
in the veetor space MN. Similarly we define the left multiplication determined 
byaasg—-ar = 2A,. Let Abe the enveloping algebra of the left and right 
multiplications of MN, ie., the smallest algebra of linear transformations in ® 
containing all the left and right multiplications. The elements of % are sums 


of terms of the type Ay, +++ Ag, @a = 7 or lL) where Ajj, is a multiplication 
determined by a;. We shall therefore denote an arbitrary clement of % by 
YA,y,, «++ Ay, (not summed on 7,!). Thus % may also be defined as the smallest 


ring of linear transformations containing all the multiplications. 

If a, , +++, a, is a basis for Rover & and A is a linear transformation in this 
veetor space, then A is completely determined by the matrix (a,;) such that 
a,A = XYa,a,,. The correspondence between A and the matrix (a@;;) determines, 

-is well known, a reciprocal isomorphism between the ring of all linear trans- 
formations in R over ® and the matrix ring ®, of all n-rowed square matrices 


Kteceived February 11, 1937; presented to the American Mathematical Society, April 9, 
1937. The author is a National Research Fellow 

'W. Landherr, Ober ernfache Licache Ringe, Hamb. Abhandlungen, vol. 11 (1935), pp 
41-4 

* Bull Am. Math Soc., vol. 41 (1935), p. 344 
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with codrdinates in ®. In particular Y% may be represented (reciprocally) as a 
subring of ®,, . 

If ¥ is an extension of the field &, we define Rz to be the algebra over Y having 
the same basis as ® has over , iec., if R = ah +--+ + a,%, then Rze = 
a> +--+ + a,2. Any linear transformation A in R over ® has a unique 
extension to a linear transformation in R: over ©. The matrix of A (relative 
to the basis a;,--+,@,) and of its extension are, of course, identical. It 
follows readily from the matrix representation that the enveloping algebra of 
the multiplications of Rs is the extension algebra Wy . 

As usual we define a (two-sided) ideal S of ® as a subspace of S such that 
S D zx, rz for allze@ and eR. Thus © is a subspace invariant under all 
the left and right multiplications and hence under all the transformations of Y%. 
M is a direct sum of the ideals Ni, --- , Re (M = Wi @--- @ R,) if every z 
in Mis expressible uniquely as 7; + --- + 2, 2;in ®;. This notion coincides 
with that of decomposability of ® relative to the system YW. Since x;2; € the 
intersection R; A R; = 0, zz; = O for any zie Ri, 2; E Rj. Ris simple if it 
has no proper ideal, or in other words, if % is an irreducible system of linear 
transformations. 

THeoREM 1. A necessary and sufficient condition that R be a direct sum of 
simple algebras is that MU be a completely reducible system. 

If R = Ri @ --- @ R., where the MR; are simple, then the R; are irreducible 
subspaces and % is completely reducible. Conversely if the 8; are irreducible, 
they are simple. For let S; be an ideal relative to Rj, ie., 2:24, riz; € S; for 
all zr, e Ms, 2€S;. Since z;z; = 232; = O for 7; eR; (fj + 1), we have rz,, 
z,x ¢ S;, and so S; is an ideal of R and hence an invariant subspace relative 
to %. This contradicts the irreducibility of R; . 

We recall that an algebra % of linear transformations in a vector space ® 
is completely reducible if it is semi-simple. Suppose conversely that YI is com- 
pletely reducible, say, R = Mi ® --- ® Re, where the R; are irreducible 
invariant subspaces, and let 2% be a nilpotent ideal of WM. If B is any subal- 
gebra of M, and S a subspace of R, we denote the subspace of elements LyB, 
yeS, BeBby SB. Since (SB)A = S(BM), SB is invariant if B is a right 
ideal. In particular RM is invariant, and since the MW, are irreducible, either 
RN = Oor RN = Ry. But RN = MR, implies Ry = RW = 0 if p is suffi- 
ciently high. Thus R,R = O and RN = 0, ie., NR = O and so Y is semi-simple. 
By Theorem 1 we have therefore 

THEOREM 2. A necessary and sufficient condition that W be a direct sum of 
simple algebras is that A be semi-simple. 


* For definitions of irreducibility, direct sum, complete reducibility, equivalence (operator- 
isomorphism) for systems of linear transformations, see van der Waerden’s Moderne 
Algebra, vol. II, 1931, §108. We shall also require a number of results on the structure 
and representation of semi-simple algebras. These may be found in §§115, 116, 118, 119, 
121 of van der Waerden's book. 
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An element z is an absolute zero-divisor if z # 0 and zx = 0 = zz for all 

z in ®. Consider Rs: where 2 is an extension of & and suppose that 2’ = 

ati +++: + a,¢) is an absolute zero-divisor. If aja; = Laxyeij (y €®), then 

za; = a,z’ = 0 implies that bx vests = 0 and 2 vets = 0. Since these linear 
, 2 

homogeneous equations have a non-trivial solution ¢; , «++ , ¢, in 3, they also 

have one, say, 61, °°*,¢, in ®, and so LYa;¢; is an absolute zero-divisor in ®. 


Thus ®: has absolute zero-divisors if and only if R has. We note also that 
a simple algebra ® has no absolute zero-divisors unless R = z& where z’ = 0. 
We suppose from now on that ® has no absolute zero-divisors. With this 
restriction we have 

THeorEeM 3. Jf R = Mi @--- S@ NM and the R; are simple, then A = 


AU, B--- B A, and the A; are simple, and conversely. A; is the enveloping 
algebra of the left and right multiplications of R; (acting in ®). 
Let RW = RN, GS --- BM, and A; be the enveloping algebra of the multiplica- 


tions of R;. The elements of %; map R; (7 ¥ 7) on 0 and R on a subspace # 0 
of ®,. It follows directly that % = % @--- @ WM. A; # O since the ele- 
ments of ®, are not absolute zero-divisors. Since the transformations of % 
map KR, on 0, the algebra YW; is isomorphic to the enveloping algebra of the 
multiplications of R; acting in R;. The latter is simple since it is an irreducible 
system of linear transformations and hence %; is simple also. Conversely if 
4 = % @--- @ A, where the A; are simple, YW is completely reducible and 
hence R = RN @B--- GS My, where the ®; are simple algebras. By the first 
part and the uniqueness of the decomposition of an algebra as a direct sum of 
simple algebras we conclude that k = k’ and Y; is the enveloping algebra of 
the multiplications of R; . 
CoroLiary. SR is simple if and only if Y is. 


2. Let © denote the centrum of A. If MR is itself associative and has an 
identity, © coincides with the multiplications determined by the elements of 
the centrum Co of WR. For ifee CC, = Cre ©, and if Ce Cand IC = ec, then 
2C (In)C = (IC)r = er = (r1)C = c(IC) = xe, so that ce C’ and C = 
C, = C,. If Ris associative but has no identity, we may adjoin an identity 
to it and repeat the argument. We then obtain the fact that € is the algebra 
of Jinear transformations determined by the multiplications of ©’ plus the 
identity mapping. When § is arbitrary we shall call © the extended centrum 
of R 

If Ris simple, so is MW and hence © = P is an algebraic field of finite order 
over ® Tf eP, 


(sy)E = (xt)y = x(yé), 


ind so Romay be regarded as an algebra over P. 
An algebra ® over & will be called normal simple if Rg , the algebra obtained 


by extending ® to its algebraic closure Q, is simple. 








— 
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TuHeoreM 4. SX is normal simple if and only if it is simple and its extended 
centrum consists of the multiples of the identity transformation. 

By hypothesis the centrum of the simple algebra & consists of the &-multiples 
of 1. It is a well-known result that %o is simple in this case. But Wp is the 
enveloping algebra of the multiplications of Ro , and hence by the corollary to 
Theorem 3 the latter is simple. On the other hand, if € is larger than 4, 
Ao is not simple* and hence R is not normal simple. 

Thus if ® is an arbitrary simple algebra, it becomes normal simple when 
regarded as an algebra over its extended centrum. ‘ 

TuroreM 5. If ® is simple and has order n over its extended centrum P, then 
4% => P,, , the algebra of n-rowed square matrices with coefficients in P. 

We regard P as the underlying field. Since Ro is simple, Wf is an absolutely 
irreducible system of linear transformations. It follows from Burnside’s 
theorem that 9% contains n’ linearly independent linear transformations and 
henee is isomorphic to P,, . 

More generally the structure of % when ® is a direct sum of simple algebras 
may be deduced from Theorems 3 and 5. 


3. Now suppose that 9 is an associative algebra with an identity. It is well 
known that the right multiplications form an algebra ®, isomorphic to R and 
the left multiplications form an algebra ®, reciprocally isomorphic to ®. 
%, (M,) is the totality of linear transformations in the vector space R commuta- 
tive with those of R, (Rp). Thus ®, 1 R, = CE. 

If R is normal simple, R, A R, = 1. If the order of NR over &, (R:b) = n 
by Theorem 5, (%:@) = n? = (M,:4)(R:@). Thus A is a direct product of 
W, and W, and so we have obtained an elementary proof of the following theorem 
due to Brauer: 

THeoreM 6. The direct product of a normal simple algebra and its reciprocal 
algebra is a complete matric algebra. 


1. We return to the general case in which ¥ is not necessarily associative and 
S-; . . 
suppose that 2 — 2° is an automorphism of ® over ®, ie., 


(c+y)*=2°+y’, (ra)* = rx*a, (ry)* = x°y’*, 
anda wis (l-l). If P= pa -++ Ags (a = ror 1) is an element of &, 
we define P* » Mii t.. where A* is the right or left multiplication 


determined by a’. P* is independent of the representation of P. For if 
De Any + As, = 2 Buy +++ Beis Gis = 1, D, ie. 


a(>> Au, +++ Aa) = 2D By, «+> Bu) 


‘If © is separable, Yo is semi-simple though not simple, and if € is inseparable, Ya has 
a radical. Cf. van der Waerden, loc. ecit., §119. 
°R. Brauer, Ober Systeme hyperkomplexer Zahlen, Math. Zeits., vol. 30 (1929), p. 108 
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for all x, then 

(D0 Aly +++ AN) = 2°(L Bi +++ Bis) 
forall.c*. Since 2* ranges over all of R when z does, we have Zz Aik, 7+ Al, = 
> Bij, +++ Bi;,. It follows that the correspondence P — P* is an auto- 
morphism of % and (2P)* = 2° P*. 

Any automorphism of an associative algebra induces an automorphism in 
its centrum. Hence if ® is simple, an automorphism of ® defines an auto- 
morphism in the field € = P, the extended centrum. 

THEOREM 7. If Ris normal simple over P and P D & such that (P:®) is finite, 
then the automorphisms of R over ® have the property (xt)* = x*t*, where & €P 
and & — &° is an automorphism of P. 

Let @ be the group of automorphisms of 8 over ® and X the subgroup con- 
sisting of the automorphisms of R over P. Theorem 7 shows that X is an 
invariant subgroup of @ and that @/X is isomorphic to a subgroup of the Galois 
group g of P over ®. 

Now suppose that R = My X P (= Mop regarded as an algebra over &) where 
My is a normal simple algebra over ®. If a, +--+ , a, is a basis for RM) over & 
or for R over P and S is an element of g, then the correspondence z = pi a;t;— 
> af = 2* is an automorphism of ® such that (xé)*! = 2"! = 2°. Let 
(%, denote the subgroup of & consisting of the elements S,. Evidently @, > q 
and @, M ¥ = I the identity mapping. By Theorem 7 any element of & has 
the form S,H where S; ¢ @, and H «X. Hence @/¥ = G, = g. This result 
may be used, as we shall show in another paper, to determine the automorphisms 
of simple Lie algebras and simple continuous groups. 


University or Cuicaco 














THE INVERSION PROBLEM OF MOBIUS 
By Erar HILie 


1. Introduction. ‘The present paper represents an attempt to give a rigorous 
treatment of certain inversion problems which have their origin in a little-known 
paper by A. F. Mébius.’ 

As a typical, though not the oldest, example of these inversion problems we 
might take the linear functional equation with constant coefficients 


oe 


(1.1) > Os an f(nz) 


n=l 


g(z), 


a formal solution of which has the form 


(1.2) > b,g(nz) = f(z). 
These problems all lead to the same infinite system of bilinear equations 
(1.3) ab, = 1, DY aabya = 0, n>, 
din 


for which the algorithm of Mébius seems a fitting name. 
This algorithm is perhaps best known from the problem of finding the recip- 
rocal of an ordinary Dirichlet series, i.e., a solution of the problem 


2 2 
(1.4) > aa? 2 ke” @ 8. 

n=l n=l 
We shall see that the properties of these series are fundamental in all these 
inversion problems. 

This observation suggests that there is a class of inversion problems associated 
with the problem of expressing the reciprocal of a general Dirichlet series or, 
still more generally, of a Laplace-Stieltjes integral as a function of the same class. 
In general the reciprocal is not so expressible, but whenever it is, certain fune- 
tional equations of the type 


(1.5) / f(uz) dA(u) = g(z) 
1 

have solutions of the form 

(1.6) q(uz) dB(u) = f(z), 
1 


Received April 24, 1937; presented to the American Mathematical Society, March 26, 1937. 
' Ueber eine besondere Art von Umkehrung der Reihen, Journal f. Math., vol. 9 (18382), 
pp. 105-123; Gesammelte Werke, vol. TV, 1887, pp. 589-612. 
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where 


(1.7) / A(“) dB) =1, l<u. 
1 , 


The last equation is the transcendental analogue of the algorithm of Mébius. 

In §2 of the present paper there is a discussion of the original problem of 
Mobius, of the algorithm of Mébius and of the problem of finding the reciprocal 
of an ordinary Dirichlet series. While there is comparatively little that is 
strictly new in this paragraph, the results are necessary for the rest of the paper 
and do not appear to be well known. In §3 we discuss equation (1.1) and 
various connected problems. In §4 we discuss equation (1.5) and the problem 
of expressing the reciprocal of a Laplace-Stieltjes integral as an integral of the 
same kind. 


2. Some classical problems. 

2.1. The algorithm of Mobius. Let A = {a,} be a given infinite sequence of 
real or complex numbers. The sequence is proper or improper according as 
a, ~ 0 or = 0, and in the former case it is normalized if a, = 1. Following 
O. Holder,’ we call 8 = {b,} the reciprocal sequence of X if the latter is proper 
and % and % satisfy the algorithm of Mébius 
(2.1.1) a,b, = 1, Dd aabye = 0, n>, 

din 
or symbolically AB = 1. The underlying product definition is that of Dirichlet 
multiplication, i.e., in general AB = C, where the sequence € = {c,} is defined by 
(2.1.2) Cc. = 5 Gabyja . 


din 


Thus, formally, 


bo} x ~~ 
>. 4.8) 68 = > e.8”. 
n=l 


n=1 n=l 
In case of the reciprocal sequence, € is simply the unit sequence 1, 0, 0,--- . 
The system (2.1.1) determines 8 uniquely. We have 
(2.1.3) be = Do (-1) "4" Carey. Gn) @an)™ *** 


where the summation extends over all factorizations of n = dj'dy* ---, and 
Ca,a,--. is the combinatorial function which gives the number of possible 
arrangements of a set consisting of a, objects of one kind, az objects of a second, 
etc. We have 


(2.1.4) De (—1)"***"* Cares... = n(n), 


2 Uber gewisse der Mobiusschen Funktion p(n) verwandte zahlentheoretische Funktionen, 
die Dirichletsche Multiplikation und eine Verallgemeinerung der Umkehrungsformeln, 
Berichte d. Siichs. Akad. d. Wiss., Math.-phys. K1., vol. 85 (1933), pp. 1-28. 
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the Mobius’ y-function, whereas 
(2.1.5) D Cae; --. = 4(n), 


the number of factorizations of n into factors # 1 (n ¥ 1, w(1) = 1) when 
attention is paid to the order of the factors. (n) is a highly irregular function.” 
For the following it is enough to note that 


(2.1.6) >> x(n)n™* = [2 — es)" 
n=l 

for R(s) > p, f(p) = 2, and that 

(2.1.7) a(n) < Cin’ 


for all values of n, whereas for every «€ > 0 there are infinitely many values 
of n for which 


(2.1.8) a(n) > Con’. 


2.2. The reciprocation problem for ordinary Dirichlet series. That the recipro- 
cal of an ordinary Dirichlet series with a, # 0 can be represented by a series of 
the same kind is well known. The best theorem in this connection is one due 
to E. Landau.‘ 

THEOREM 2.2.1. Let 


(2.2.1) D(s;a,) = Doann™", a #0 

n=l 
have a domain of convergence, and let the function represented by the serics be 
holomorphic and different from zero fora > a. Then 


(2.2.2) [D(s;an)1* = Do ban 
n=1 
is convergent for ¢ > a. 

This theorem lies quite deep. It naturally brings up the question whether 
it is possible to assign upper bounds for the real parts of the possible zeros of 
D(s; a,) and thus also for the abscissa of convergence of the reciprocal. The 
answer is in the affirmative and is fairly trivial. 

THEOREM 2.2.2. Let {r,} be a given sequence of positive numbers, 1, = 1, 
r, = O(n") for some fixed real x. Consider the class D of all Dirichlet series 
D(s; a,) with ay = 1,\a,| = 7r,,n 22. Let S be the abscissa of convergence of 
D(s; rn) and put D(S + 0;7r,) = RS «w. If R > 2, the equation 


(2.2.3) D(o;r,) = 2 


3 See E. Hille, A problem in ‘‘factorisatio numerorum’’, Acta Arithmetica, vol. 2 (1936), 
pp. 134-144. x(n) is denoted by f(n) in this paper. 

4 Uber den Wertevorrat von ¢(s) in der Halbebene o > 1, Géttinger Nachrichten, 1933, 
pp. 81-91, p. 90. 
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has a real root p = p(2) > S. No series of D has any zeros in the half-plane 
o > p, whereas there exist series in D having infinitely many zeros in the strip 
p—e<a< p for everye > 0. If, on the other hand, 1 < R S 2, there are 
no zeros of any series in D for o > S and there are series having either zeros or 
singular points in every strip S —e <a <S. 

Proof. The verification of the fact that the series in D cannot have zeros 
in the half-planes ¢ > p(D) and o > S, respectively, is elementary and may be 
left to the reader. Further, the series [D(s; a,)]"' = D(s; b,) are easily shown 
to be absolutely convergent in the same half-planes. 

If R > 2, we note that the series 


zn 
1 —- > ran 


n=l! 


is a member of the class T and vanishes at s = p(D). It further has infinitely 
many zeros in any strip p(D) — « < o < p(D). If 1 < R S 2, the point 
s = Sis a non-polar singularity of D(s; r,) and consequently also a singularity 
of [D(s; r,)]"’. It follows that in either case the estimates given are the best 
possible valid for the whole class D. 

It would be of some interest to know if these estimates for the upper bound 
of the real parts of the zeros are imposed upon us by a relatively small set of 
elements in 2 or if they represent the rule rather than the exception. A dis- 
cussion of this question in general calls for an interpretation of D as a topological 
space, i.e., a definition of closure, possibly based upon a definition of distance 
or of measure. 

But there is one very special case in which a complete answer is available 
without any topology. Suppose that a, = 0 unless n is a prime, and put 


ay, = a, Toe = Pe, D(s;a,) = P(s; ax), D = §<. 


Jet us suppose that Rk > 2. Using a classical theorem of H. Bohr on the rela- 
tion between the set of values of a Dirichlet series and of the associated power 
series in infinitely many unknowns,’ we conclude that every series P(s; ax) has 
infinitely many zeros in every strip p(B) — € < o < p(B). Indeed, the asso- 
elated power series is siraply the linear form 


> — 1] 
L(x) 1 + > ann 1 + D> me "~“. 
1 1 


and putting z; em? we get Lis) | bi pap” 0. By Bohr’s 
1 


theorem the value G is taken on infinitely often by ’(s; ey) in every strip 
pea —p Hence in this ease all the reciprocal series in B have the same 
abecises of convergence, viz., pl) 

‘ther due Bedeutung der Potenzrechen unendlich vieler Variabeln in der Theorie der 
Larichletechen Vehen Lan *, Gottinger Nachrichten, 1914, pp. 441-488, p. 451, 
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2.3. Mébius’ problem. Mobius’ raised the following question: given a power 
series 


(2.3.1) f(z) = > a,z", a, ~ 0, 


n=l 


find the expansion of z in terms of the functions f(z"), n = 1, 2,3, ---. Let it be 


(2.3.2) 2 = ES dafle’). 


n=l 


A straightforward calculation shows that the b’s must satisfy the algorithm of 
Mobius. Further, it is clear that if 


(2.3.3) F(z) = » A,2", 
then 

(2.3.4) F(z) = > B, f(z"), 
where 

(2.3.5) B, = > ba Anja. 


All this is highly formal and an analyst naturally wants to know the range of 
validity of the formulas, conditions for convergence, ete. 

As a preliminary step in this study, let us suppose that the power series in 
(2.3.1) has a circle of convergence, and form the adjoint power series 


(2.3.6) ols) = >, b.2". 


We eall g(z) the Mobius transform of f(z), 
(2.3.7) e(z) = ML S(2)]. 


The Mobius algorithm shows that conversely f(z) is the Mobius transform 


of y(2), 
(2.3.8) MIME] = fle). 


ie., the Mébius transformation is an involution. 

We must show that the power series in (2.3.6) is also convergent. This is 
established in 

Turorem 2.3.1. Let the radii of convergence of the power sertes in (2.3.1) and 
(2.3.6) be Ry and Ry respectively. Lf 0 © Ry <1, then Ry R30 ls Ry, 
then also l = Ry. 


"Loe, eit., Werke, vol, IV, p. oot 
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Proof. Choose Ry < R,. Then there exists an M such that |a,! < MR,” 
for every n. Hence by (2.1.3) 
Witteman” ee 


If R; S 1, then Ry < 1. We can assume M 2 1 without restricting the 
generality. Further, 


ad; + ad, +++: Sn, a +a+--: S p(n), 
where p(n) denotes the total number of prime factors of n. Hence by (2.1.5) 
(2.3.9) ib, | S 2(n)M?™ Ro". 


Since p(n) = O (log n) and x(n) = O(n’), we conclude that Ry S R.or R, < R2. 


Suppose next that Ri; > 1. We can then choose Ry > 1. Further, 


aid; + od, +--- 2 “Pi, + V2 Di. +:-:-= P(n) 
ifn = pi, pi, +: , Where the p,, are the distinct prime factors of n. Hence 
(2.3.10) b,, | < a(n) MR used 


But P(n) is infinitely often o(n). It follows that 


lim |b, |" < 1, 
n--o 
or Rk, 2 1. 
In order to complete the proof for the case Ry < 1, we use the involutory 
character of the transformation. We have shown that R; = R.. If Re < 1, 


we can conclude that R: S R, , i.e., Ri = Re, simply by noticing that f(z) is the 
MoObius transform of ¢(z). On the other hand, the assumption R, > 1 implies 
by the same argument that R; = 1, and this contradicts the original assumption. 
Hence R,; < 1 implies Ri = Re. 

If Ri = 1, we may well have Rk, > 1. The situation becomes clearer by 


introducing the associated Dirichlet series 


bo} o 
D(s;a,) = do a,n™, D(s;b,) = >, ban 
n=l n=1 
The assumption 2: > 1 implies that D(s; b,.) converges for all s and is an entire 
function of s. Hence D(s; a,) has a half-plane of absolute convergence and is 
not merely a formal Dirichlet series; moreover, a, = O(n") for some finite 
value of x. 
Thus if R; = 1, we have R2 = R; unless D(s; b,) is an entire function, and then 
R., = R,;. This case can arise only when a, = O(n’). 
If R; > 1, D(s; a,) is an entire function of s, and normally Rk. = 1, unless 
D(s; a,) ¥ 0, in which case R, = 1. 
After this discussion it is easy to discuss the validity of Mébius’ inversion 
formula. 
THEOREM 2.3.2. If R, is the radius of convergence of the power series for f(z), 
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the inversion formula (2.3.2) is valid for |z| < min (R,, 1). Jf Ry < 1, the 
series diverges for Ri < |z| <1. The series may converge outside of the unit 
circle, but normally it does not represent z for such values. If the radius of con- 
vergence of F(z) in (2.3.3) is R, formula (2.3.4) is valid for | z\| < min (R, R,, 1). 

Proof. Let 0 < R; S 1. Then all the terms in (2.3.2) are regular analytic 


functions of z in |z| < R,. By (2.3.9) 


>| bp f(z") | | <2 |b, PACD |-fe]™" 


a= 


(2.3.11) 


p(n)+1 | Z| 
<> x(n)M Ro” — 2 /*/Ra’ 
and this is clearly convergent for |z| < Ry. Here Ry < R; and as near to Ry 
as we please, i.e., the series in (2.3.2) is absolutely convergent for |z| < R,. 
Moreover, the double series obtained by substituting the power series for f(z”) 
on the right side of (2.3.2) is absolutely convergent, as we have just seen. It 
can consequently be rearranged at liberty. Collecting powers of equal degree 
and reducing with the aid of the algorithm of Mébius, the double series reduces 
to its first term z. This completes the proof for the case R, < 1. 
If R, > 1, the terms of the series (2.3.2) are holomorphic for | z | 
in no larger region. For such values formula (2.3.10) shows that the double 
series is dominated by 


< 1 and 


\ . p(n) +1 zry—P(n) 
(2.3.12) DX x(n)M"™ Ro i= Te’ 


It follows that the inversion formula is valid for |z| < 1. 


Let |z| <1. Then 


lim |b, f(z”) |""" = |z| lim |b, |" = |z|/Re. 


n—-2 


It follows that if Ri < 1, so that R, = R,, the series (2.3.2) diverges in the 
annulus R; < |z| <1. 

Outside of the unit circle the situation may differ considerably in different 
eases. Thus if 


f(z) = i =. a then z= , u(n) | 


a=l 


which clearly diverges for |z| > 1. But if 


z 
f(2) = 1_—2’ 
then 


— u(2k + 1),n = 2k+1, 
eS n = 2k, 
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so that the series 


converges also outside of the unit circle, but to —1/z instead of toz. Finally, if 
— 
f(z) = ze", 


where k is a positive integer, then the series 
we 
Zz b x ae 
n~ 
n=l 


converges on the rays |z| > 1, arg z = v2x/k, vy = 0, 1,---,k — 1, and 
nowhere else outside of the unit circle. The sum of the series tends to zero as 
z— « along the rays in question; thus, the sum of the series cannot be z, but 
I am unable to determine its actual value. 

The reader will have no difficulty in verifying the statements concerning 
F(z) in Theorem 2.3.2 on the basis of the estimates for the coefficients, and this 
part of the proof will be omitted. 


3. A class of linear functional equations. 

3.1. The Mobius A-transform. Let A = {a,} be a given proper, normalized 
sequence, 8 = {b,} the reciprocal sequence in the sense of §2.1. Mébius’ 
observed that the same algorithm enters in the study of the functional equation 


(3.1.1) ° G(z) = > a, F(z") 
n=1 


for which he proposed the solution 
(3.1.2) F(z) = 2) b.G(e"). 
n=l 


Conversely, (3.1.1) is a solution of (3.1.2) if F(z) is the given function. Though 
Mobius claims that these relations hold for arbitrary given functions, he has 
presumably only had power series in mind, and there is no indication that he 
looked into convergence questions at all. 


Putting 
Ge’) = g(z), Fe’) = f(z), 
we can rewrite the functional equations as follows: 
(3.1.3) q(z) = > a, f(nz), 
n=l 
(3.1.4) f(z) = Zz: b,.g(nz). 
nel 


? Loc. cit., Werke, vol. IV, p. 593. 
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These forms are more convenient to handle than the original ones, and will 
serve as the basis of the discussion in the present paragraph. Special cases have 
long been in the literature. Thus the case a, = 1, b, = u(n), z = k, gives 
the inversion formulas 


(3.1.5) g(k) = > f(nk), 
(3.1.6) f(k) = p> u(n)g(nk), 


which are used extensively in analytical number theory.” 

We proceed to an analytical discussion of equation (3.1.3). Let E be a set 
of points in the complex plane such that if 2 contains the point zp it also con- 
tains all multiples nz of z ,n = 2,3,---. Let f(z) be given in E and such that 


(3.1.7) Mf); MW = X a, f(nz) 

n=1 
converges in FE. We call this function the Mébius Y-transform of f(z) with 
similar notation and terminology for 8 and for other sequences. The reci- 
procity of the two sequences A and & is reflected in the property 


(3.1.8) M (MF); W; VB] = MIME); Bl; W = fe), 


valid for sufficiently restricted classes of functions f(z). 
THEOREM 3.1.1. A sufficient condition for the validity of (3.1.8) is the absolute 
convergence of the series 


(3.1.9) S[f] = > amb, f(mnz). 
m=1 1 


a 
Proof. The series, being absolutely convergent, can be rearranged arbitrarily. 
If summed by columns its sum is M{[M[f(z); W; Bl, if summed by rows, 
IM [MLf(z); B]; W, whereas summation over constant values of the product mn 
gives simply f(z), by virtue of Mébius’ algorithm. 
3.2. Inversion of the A-transform. We now turn to the question of finding 
the inverse of the %-transform, i.e., the resolution of the equation 


(3.2.1) ML f(z); W = gz) 
for f(z) in terms of g(z). 
THeoreM 3.2.1. A sufficient condition that 


(3.2.2) f(z) = Mig(z); BI 


‘See, e.g., P. Bachmann, Die analytische Zahlentheorie, Leipzig, 1894, p. 310 et seq. 
Bachmann does not seem to have been aware of Mébius’ paper. Thus he credits the 
introduction of the function u(n) to F. Mertens, Ueber einige asymptotische Gesetze der 
Zahlentheorie, Journal f. Math., vol. 77 (1874), pp. 289-338. 
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be a solution of (3.2.1) which is absolutely convergent in E is that the series S{g] 
be absolutely convergent in E. On the other hand, there can be at most one solution 
of (3.2.1) which renders the series S{f| absolutely convergent, and whenever it exists 
this solution is given by (3.2.2). 

Proof. The assumption that S[{g] is absolutely convergent implies that 
M[Mlg(z); Bl; 2 = g(z) by Theorem 3.1.1. Hence (3.2.2) gives a solution 
under these circumstances, and the solution is evidently absolutely convergent. 
Conversely, if f(z) is a solution of (3.2.1) such that S[f] is absolutely convergent, 
then for the same reason 


S(f] = f(z) = MMAF(z); AW; Bl = Mig); Bl, 


so the solution in question is uniquely determined and given by (3.2.2). 

It must be granted that Theorem 3.2.1 is of a rather restrictive character. 
It should be pointed out, however, that the mere existence of M{f(z); YW] is not 
enough to insure that this function is a solution of (3.2.1). Thus, for example, 
if there exists a non-vanishing function g(z) such that M[g(z); B] = 0, then 
formula (3.2.2) certainly does not give a solution of (3.2.1). See further §3.3. 

The following theorem is of a somewhat different character. 

THEOREM 3.2.2. Let g(z) be holomorphic in the sector S, 0, < arg z < 62, 
,z > R> O, and let 


(3.2.3) g(z) =z ‘lo a4 o(+)| asz— 2 inS, 


Further, suppose that D(s; a,) = > a,n™ is convergent and different from zero 
1 
for R(s) > Rla) — 6 € > O. Then (3.2.2) defines a solution of (3.2.1), holo- 


morphic in S, and 


(3.2.4) f(z) =z *|colDla a) a o( . )| asz—> x ims, 


|z| 
and this is the only solution of such asymptotic character. 


Proof. We have to show that M([Melg(z); Bl; A] = g(z) under the given 
assumptions. We start by observing that 


(3.2.5) Miz *; Al = Dla; a,)z *, 
(3.2.6) Mz *; Bl = Dla; b,)z-", 


the convergence of D(a; b,) being a consequence of Landau’s Theorem 2.2.1. 
Let us put 


(3.2.7) g(z) = ez “ + gi(z), 

(3.2.8) f(z) = @ D(a; b,)z * + filz). 
Then if f(z) is a solution of (3.2.1), fi(z) is a solution of 

(3.2.9) Mi filz); A) = gilz). 
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But to this equation we can apply Theorem 3.2.1. Indeed, by assumption 
gi(z) = O(| 2 |"), y = Ra), and the series D(s; a,,) and D(s; b,) are absolutely 
convergent for s = 1 + y, since they converge for s = y — e/2. Forming 
Sg] and replacing each term by its absolute value, we find that the series is 
dominated by a constant multiple of 


2 cs) 
awe _ | amb, | (mn)-*? 
m=1 n=1 


convergent in S. Hence fi(z) = Mfgi(z); B] is a solution of (3.2.9), and 
M[g(z); B] is a solution of (3.2.1). 


Further, 
( bes ) 
filz) = Os | 2} 7 7: | ba | nM ™, 
n=1 J 
or 
(3.2.10) lfi(z)| = M|z2{"”, 


whence it follows that the double series S[f;] is absolutely convergent in S. 
Hence, by Theorem 3.2.1, fi(z) is the only solution of (3.2.9) having this property; 
and, a fortiori, the only solution satisfying (3.2.10). It follows that M{g(z); B] 
is the only solution of (3.2.1) of the form (3.2.8), where f;(z) satisfies (3.2.10). 

It is obvious that the solution breaks down if D(a; a,) = 0. In general, it 
also breaks down if D(s; a,) = 0 for an ¢ with R(s) > Ra). It should be 
noted, however, that 


[D(a;a,)]) 2 * 
is a solution of 
MLf(z); A = 2° 


under the sole assumption that D(a; a,) is convergent and different from zero. 
This observation may sometimes be used in order to extend the validity of our 
solution. 

Thus, for example, Theorem 3.2.2 does not apply to the case a, = (—1)""' 
if 0 < @ < 5, but formula (3.2.2), nevertheless, gives a solution of the corre- 
sponding equation. This is readily seen by running over the proof again with 
this particular choice of the parameters. 

Further, it should be noted that the assumption on the remainder in (3.2.3) 
is chosen merely with the view of insuring that the Dirichlet series D(s; a,) 
and D(s; b,.) be absolutely convergent for s = 1 + y. If there should exist a 4, 
0 <6 < I, such that these series converge absolutely for s = 5 + y, it is suffi- 
cient for our purposes to assume that z“g(z) = ¢@ + O() 2) °). 


3.3. Additional remarks. We are dealing with two adjoint equations 
(3.3.1) Mlar(z), M] = a(z), 
(3.3.2) Mly(z), Bl = hie), 
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and the corresponding homogeneous equations 


(3.3.3) Mlu(z), A] = 0, 

(3.3.4) Mlv(z), Bl] = O. 
We have already observed that 

(3.3.5) Mz“, A] = D(a; a,)z ", 


provided R(a@) > o», the abscissa of convergence of the series for D(s; a,). 
In the same domain we have 

‘ Qo" i 9 a" - 
(3.3.6) mM [ae ~[D(a; a,)z “]. 


da" ~ Aa 
From this we conclude that if the equation 
D(s; a,) = 0 
has a k-fold zero at s = a@ with R(a) > oo, then 
(3.3.7) z“,z2 “logz,--:,z ” (logz)*” 


are solutions of the homogeneous equation (3.3.3). 
It is clear that, if there are infinitely many zeros of D(s; a,) in the domain 
of convergence of the Dirichlet series, then any function of the form 


(3.3.8) > ene 
n=l 


satisfies (3.3.3), provided the double series 


D> an >. ¢n(mz)~*" 
m=1 n=l 
can be rearranged so as to interchange the order of the summations. 

It is perhaps worth while remarking that the series (3.3.8) do not form a 
dense set in any of the function spaces usually considered, such as C[1, ©] or 
L,(1, ~), 1 S p < «. Indeed, the set {z “"} will be closed in the space in 
question only if the series 


a + bR(a,) 
X 1 + | On [? 


diverges, where a and b are constants depending upon the space. But in our 
case R(a,) is bounded; hence we are demanding the divergence of the series 

‘a, |. But according to Landau’ the number of zeros of an ordinary 
Dirichlet series in the domain ¢ 2 oo + ¢, |t| < T, is O(T log T) and this 
frequency clearly does not permit the divergence of lan|~ 


*F. Landau, Uber die Nullstellen der Dirichletschen Reihen, Berliner Sitzungsberichte, 
vol. 14 (1913), pp. 897-907. 
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Our next remark concerns the existence of a solution of (3.3.1) when g(z) 
satisfies (3.3.4). It is clear that the inversion formula of Mébius cannot give 
a solution in this case. It seems very plausible that no solution can exist under 
these circumstances. In certain simple cases it is possible to verify this surmise. 
Take, for example, 


xe 


> f(a's) = 1. 


k=0 


Here D(s; b,.) = 1 — 2° and M{[1, B] = 0. Consider any domain E of the 


type described in §3.1, i.e., if z « Z, so do nz for n = 2,3,---. If the equation 
holds for z = z and for z = 2z , then we get by subtraction f(z) = 0. If it is 
true for z = 2*z for every integer k, we get by the same argument that 


f(2‘z.) = 0 for every k. But this clearly contradicts the assumption that the 
equation holds for z = z. Thus the equation in question cannot have any 
solution in E or even in a point set S which contains 2z) whenever it contains 2p . 

The final remark of this paragraph concerns the solution of (3.3.1) when g(z) 
is a solution of (3.3.3). It is enough to consider the case g(z) = z *, where 
s = ais a k-fold root of the equation D(s; a,) = 0 in the half-plane of con- 
vergence of the Dirichlet series. Equation (3.3.6) then shows that 


a(e) = 2 *{(—1)[D™ (a; a,)I (log 2)* + & 6, (log 2) 


is a solution of (3.3.1). This result shows a further analogy between the formal 
theory of the equations here considered and that of linear differential equations. 

3.4. Further equations with the same algorithm. It was known to Mébius” 
that his algorithm entered in the study of other functional equations. The 
following example is slightly more general than the situation which Mébius had 
in mind. 

We consider two multiplicative systems, i.e., we give two sequences {«(p)} 
and {w(p)}, where p runs through the primes, we take e(1) = w(1) = 1 and 
define «(n) and w(n) by the equations 


(3.4.1) e(mn) = e(m)e(n), w(mn) = w(m)w(n). 
Let us form the functional equation 


(3.4.2) > a,e(n) f(w(n)z) = g(z). 


It is not difficult to see that a formal solution is given by 


(3.4.3) f(2) = E dae(nda(w(n), 


'0 Mobius, loc. cit., Werke, vol. IV, p. 594. 
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where |b, as usual is the reciprocal sequence of {a,}. The elementary methods 
of $3.2 can be used to develop sufficient conditions for the validity of this 
inversion formula.’ The details can be left to the reader. 


4. General algorithms. 
4.1. The Mobius A(u)-transform. We can write the Mobius %-transform as a 
Stieltjes integral, viz., 


M[fl(z), A) = [ feydacw, 


where 


A(u) = > ™ Re 
n<u 
This suggests a generalization of the inversion problem to arbitrary functions 
A(u) of bounded variation. 
Let A(u) be given for u 2 1, A(1) = 0, of bounded variation in every finite 
interval, and such that 


(4.1.1) A(u) = O(u*™) 


for every « > 0, where w 2 0 is a constant. 

Suppose now that f(z) is an analytic function satisfying the following condi- 
tions: (i) f(z) is holomorphic in a sectorial domain S such that if z is in S, so 
is uz for every u = 1; (ii) the integral 


(4.1.2) mise-ACa] = [ f(uz) dA(u) 
1 


exists in some domain S, C S. 

We shall as a rule use the abbreviated notation IN{f, A] and refer to this 
function as the Mébius A(u)-transform of f(z). To every fixed function A (u) 
satisfying the above conditions there is a class § [A] of functions f(z) which admit 
A(u)-transforms in the sense of the definition. 

The effective determination of §[A] may be quite laborious except in the 
simplest cases, but it is easy to find a subclass of §[A]. Let f(z) be holo- 
morphic in a sectorial domain S and 


(4.1.3) f'lz)|< Mie|™, Y>w. 


It is easy to see that Wf, A] exists in S; thus every such function belongs 
to ®A |. 

4.2. The reciprocation problem for Laplace integrals. In the case of the Y- 
transform the inverse or reciprocal transform was given a priori and had a 
sense for a sufficiently restricted but not vacuous class of functions. For the 


! Some instances of this inversion formula figured in the papers of I. Hille and O. Sziisz, 
On the completeness of Lambert functions, of which the first part appeared in the Bulletin of 
the American Mathematical Society, vol. 42 (1936), pp. 411-418, and the second in the 
Annals of Mathematics, vol. 37 (1936), pp. 800-815. 
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A(u)-transform the situation is different, and the inverse transform need not 
exist at all. By analogy with the sequence case we should consider the Laplace- 
Stieltjes integral 


(4.2.1) D(s) = . u “dA(u). 


The hypothesis (4.1.1) insures the convergence of this integral for ¢ > w. 
We should then take the reciprocal of D(s) and find its representation as a 
Laplace-Stieltjes integral. But it is well known that [D(s)]™ ordinarily is not 
representable in this manner. 

Necessary and sufficient conditions in order that [D(s)}"' shall be repre- 
sentable by a convergent Laplace-Stieltjes integral do not seem to be known. 
In the following I shall give two sets of two conditions each. One condition is 
common to the two sets; the first set is necessary, but perhaps not sufficient, 
the second set is sufficient, but certainly not necessary.” 

THEOREM 4.2.1. Let D(s) be a function representable by a convergent Laplace- 
Stieltjes integral. In order that [D(s)|“ shall also admit such a representation, 
it is necessary that (i) lim D(c) # 0, and (ii) there exist a half-plane « > a 

ox 
in which D(s) ¥ 0. 


Proof. That the conditions are necessary is obvious. Since 


lim D(o) = lim A(u), 


o> +o u—il+ 


we can replace condition (i) by the equivalent condition (i’) A(u) is discon- 
tinuous at u = 1. 

THEOREM 4.2.2. If A(u) is discontinuous at u = 1, A(1 + 0) = a # 0, 
and if the Laplace-Stieltjes integral representing D(s) is absolutely convergent for 
« > a, then [D(s)|"' is representable by a Laplace-Sticltjes integral absolutely 
convergent for ¢ > max (a, 8), where B is the root of the equation 


(4.2.3) la|= [ u “dVi[A(e) — al,” 
1 


if it exists; otherwise B = — x. 
Proof. Let us put 
Av) = AW) — a, Ax(1) = 0, 
ViYAi(v) = Ao(u), 


A,(u, s) = [ v “dAj(v), 
1 


Ao(u, s) = [ v “dAol(r). 
1 


" It is to be hoped that the investigations by R. H. Cameron and N. Wiener, now in 
progress, will throw further light on this question. 
8 Here and in the following, V°f(t) denotes the total variation of f(t) ina St S 6 
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Then Ay(u), Ai(u), Ao(u, s), and A,(u, s) are continuous at u = 1 and tend to 
zero as u — 1. A simple consideration shows that 


(4.2.4) Vi Air, s) < Aj(u, c). 

By assumption 

A,(u) = O(u*™*) 

for every « > 0. It follows that for ¢ > a@ the increasing function Ao(u, ) 
tends to the finite limit Ay(*, o«) as u — «. Further, it is obvious that 
Ao(~, ¢) is monotone decreasing when ¢ increases and tends to zero aso > &. 
The latter conclusion follows from the fact that Ao(w) — 0 monotonically from 
above as u— 0+. Hence the equation 
(4.2.5) A(x, o) = |a| 
has at most one root = a. We define 8 to be equal to this root if it exists; 
otherwise we take B = — =~. 

Now let ¢ > max (a, 8). Then 


D(s) = a + Ai(~, 8), 


and 

|Ai(x,s)| S A(x, o) < jal. 
Hence 
(4.2.6) [D(s)}J' = dD (-1)"a""[Ai(, 8)]", 


and the series is absolutely convergent. We shall rewrite this series as a 
Laplace-Stieltjes integral. We have for ¢ > y > max (a, 8) 





(4.2.7) A,(x, 8s) = [ u ?d, Ay(u, y). 

Hence 

(4.2.8) [A,(«,s)]" = [ u" ’d,A,(u, y), 

where 

(4.2.9) A,(u, 7) = [ A,1(u/v, y) d, Ax(v, y) (n = 2,3, +--+). 


Here (4.2.8) is absolutely convergent, being the product of absolutely con- 
vergent. Laplace-Stieltjes integrals.‘ This fact also follows from the subse- 






quent estimates of A,(u, 7). 






For the properties of Laplace-Stieltjes integrals used in this paper, consult D. V. Wid- 
der, Trans. Amer. Math. Soc., vol. 31 (1929), pp. 694-743, and Kf. Hille and J. D. Tamarkin, 
Proce, Nat. Aead. Sei., vol. 19 (1933), pp. 573-577, 902-912; vol. 20 (1934), pp. 140-144. 
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We shall prove the inequality 
(4.2.10) | A,(u, vy) | S ViA,(v, y) S [Ao(u, y)]". 


The inequality is obviously true for n = 1. Suppose that it has been proved 
for n = k. It follows in particular that A,(u, y) is continuous at u = 1 and 
tends to zero as u— 1. Using (4.2.9) with n = k + 1, we see that Agui(u, y) 
has the same property, whence it follows that it is sufficient to prove the second 


half of the inequality. But 
u v | 
[ af A,(v/t, Y) d, Ax(t, 7) 
| 


< [ d, [ | A,(v/t, Y) |-| d, Ait, Y) | 


s [ d, [ [Ao(v/t, y)]* de Ao(t, v) 
1 1 


Vy Agyilv, y) 


~ [ [Ao(u/t, y)]* ds Ao(t, y) 


= [Ao(u, YI" [asst Y) 


= [Ao(u, y)]*"". 


This completes the proof of the inequality. 
Let us put 


a 1 = cal a ~a§ | k 
(4.2.11) |Blu, y =a + X ( 1)"a""" A,(u, ), u>1, 
Then by (4.2.10) 
(4.2.12) | Blu, y)| S Vi Be, y) Ss Le lal"fAo(w, v1", 


the series being absolutely convergent and uniformly bounded for 1 < u < ©. 
Hence B(u, y) is of bounded variation in [1, 2%] and 


(4.2.13) [D(s)|' = / u “"” d, Blu, y) = | u“dB(u), 
1 1 
where 
(4.2.14) B(u) = | v’ d, Biv, y). 
Ji 


The second integral in (4.2.13) is clearly absolutely convergent for ¢ > max 
(a, 8). This completes the proof of the theorem. 
The assumption that D(s) has a half-plane of absolute convergence is obv iously 
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unnecessarily restrictive. But simple convergence is not enough to insure the 
existence of even a formal Laplace integral for the reciprocal, much less of a 
half-plane of convergence. 

4.3. Inversion of the A(u)-transform. In the present paragraph we shall 
assume that the reciprocal of D(s) admits a representation by means of a 
convergent Laplace-Stieltjes integral. The functions A(u) and B(u) are then 
joined by the relation 


(4.3.1) [ B(u/v) dA(v) = [ A(u/v) dB(v) = 1, u>l, 
1 1 


for almost all values of u. This is the transcendental analogue of the Mébius 
algorithm to which it reduces when A(u) is a step function with jumps at the 
integers. 

We can now expect that for a sufficiently restricted class of functions f(z) we 
shall have 
(4.3.2) MIMS, A], B} = MIMS, Bl, A} = fz). 


We have the following analogue of Theorem 3.2.1. 
THEOREM 4.3.1. A sufficient condition that (4.3.2) shall hold is that the double 
integral 


(4.3.3) I{f|= I [ f(uvz) dA (u) dB(v) 


be absolutely convergent. 
Proof. Let 


V¥A(t) = Ao(u), ViB(t) = By(u). 


The condition of the theorem is then that the integral 


i i f(uwz) | dAg(u) dBy(v) 
i 1 


be convergent. It is then permissible to regard (4.3.3) as a repeated Stieltjes 
integral and the order in which the integrations are performed is immaterial. 


Now 
[OE LF sea) aa) 
‘uvz) dA(u)? dB(v), 
1 Wh s 


ae a 
MIMS, Bl, Ay = | ‘ff f(uvz) dB) dA(u). 
1 (hh J 


MIM, Al, By 


Hence these two operations exist and give the same result. On the other hand, 
going back to the definition of the Stieltjes integral as a double sum and “sum- 
ming by hyperbolas” wo = const. before passing to the limit, we can show 
that the double integral can be written in the form 


[F sove) ae f B(w/u) dA(u). 
1 1 


, 
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Formula (4.3.1) shows that this expression reduces to f(z). This completes the 
proof of the theorem. 

We can now pass to the question of inversion. 

THEOREM 4.3.2. Let B(u) exist as a function of bounded variation and satisfy 
(4.3.1). A sufficient condition that 


(4.3.4) f(z) = Mig, BI 
be a solution of 
(4.3.5) g(z) = Mf, A] 


in the domain S is that the double integral I\g| be absolutely convergent in S. On 

the other hand, there cannot be more than one solution f(z) of (4.3.5) which renders 

I{f| absolutely convergent, and whenever it exists this solution is given by (4.3.4). 
Proof. The assumption that J|g] is absolutely convergent implies that 


MMe, B], A} = g(z) 


by Theorem 4.3.1. Hence (4.3.4) gives a solution of (4.3.5) and the integral 
is obviously absolutely convergent. 

Conversely, if f(z) is a solution of (4.3.5) such that J[f] is absolutely con- 
vergent, then for the same reason 


f(z) = I[f] = MIMS, A], B} = Mg, Bo, 


so that the solution in question is uniquely determined and given by (4.3.4). 

Again it is necessary to remark that the mere existence of Yt{g, B] in a domain 
S is not sufficient to insure that this function be a solution of (4.3.5). Indeed, 
suppose that the Laplace-Stieltjes integral 


[D(s)}"' = [ u* dB(u) 
1 


has a zero s = @ in the half-plane of convergence. Then 
x 
Mz *, B) = z* u “dBlu) = 0 
1 
and is certainly not a solution of the equation 


ML f(z), A(u)] = 2. 


4.4. Concluding remarks. In the previous discussion | have perhaps over- 
emphasized the rdle of the associated Laplace-Stieltjes integral 


D(s) - | u-* dA(u). 
1 


It should be observed that this function and its reciprocal are mainly tools in 
the discussion, and the decisive réle is really played by the reciprocal functions 
A(u) and B(u) which are supposed to satisfy the algorithm (4.3.1). We know 
from the sequence case that these functions may very well exist without the 
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associated Laplace-Stieltjes integrals having any domain of convergence or even 
any a priori obvious significance. The theorems of §4.3 really presuppose 
merely the existence of a pair of functions satisfying (4.3.1) and not the existence 
of the associated Laplace-Stieltjes integrals. 

But it must be admitted that when the integrals do not exist, the problem of 
finding the function B(u) reciprocal to a given function A(u) is in general not a 
very promising one. Sometimes we may circumvent the difficulties by a pre- 
liminary application of a suitable method of summation. Thus it may happen 
that s "[D(s)]" is representable by a Laplace-Stieltjes integral even though 
[D(s)]" is not. In this case the corresponding function B,(u) can be used to 
find an n-fold integral of the formal solution of (4.3.5) from which the solution 
itself may be found by solving an integral equation of the Abel type. 

There are of course other functional equations which may be treated by the 
methods of this paper; for instance, the equation 


r F(z — u) da(u) = G(z), 


which is associated with Laplace-Stieltjes integrals having 0 instead of 1 as the 
lower limit of integration. 

Finally, it should be remarked that there are various relations, some obvious, 
others less so, between the functional equations treated in this paper on one 
hand, and the theory of Watson transforms and the Karamata-Wiener Tauberian 
theory on the other. The author hopes to return to these questions at a later 
opportunity. 


YALE UNIVERSITY. 











ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 
By H. S. VANDIVER 


1. Introduction. In the present article a report will be given on the work 
which has been carried out under a grant made to the writer from the Penrose 
Fund of the American Philosophical Society. 

One of the objects of the work was to extend the known tables of Bernoulli 
numbers expressed as rational fractions in their lowest terms and to check all 
previous tables. This was carried out by D. H. Lehmer,'’ who tabulated B, 
for n = 91 to 110, inclusive. The previous tables had given the values 
B, (n = 1, 2, --- , 92). The first ninety of these were reproduced in the tables 
of H. T. Davis.” Here 


By = (—1)" "bea 
and 
(b+ 1)" =b, (n # 1), 


where the expression on the left means that (6 + 1) is taken to the n-th power 
by means of the binomial theorem, and by is substituted for b* (k = 1,2, --- , n). 
All the above mentioned B’s were employed by Lehmer in checking the regu- 
larity of primes, a prime p being defined as regular if it does not divide the 
numerators of any of the first (p — 3)/2 B’s. The details of these latter com- 
putations will be treated below. 

Another object of the work under the grant was to extend the results of the 
writer on Fermat’s Last Theorem for special exponents. 

In other papers’ it was established that 


(I) a+y'+2' =0 


is impossible for x, y, and z non-zero rational integers and / a given integer 
2 <1 < 307. In the present article we shall describe the work which estab- 
lished this result for 306 < 1 < 617, excepting | = 587 which exponent has not 
yet been tested (ef. note, p. 584). As the methods employed for these large ex- 
ponents are quite elaborate and complicated, we shall explain many details. 


Received May 24, 1937. 

! This Journal, vol. 2 (1936), pp. 460-464. This article includes references to previous 
tables. Lehmer computed the value of Bis in addition to those mentioned but not as 
part of the present project. Cf. Annals of Math., vol. 36 (1935), p. 648 

2 Tables of Higher Mathematical Functions, vol. 2, Bloomington, Indiana, 1935, pp 
230 233. 

* Proc. Natl. Acad. Sei., vol. 17 (1931), pp. 661-673 (referred to later as N. A.) with 
references there given. 


oo 
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We have persisted in the examination of special exponents in (I) in the hope 
that, if one of the criteria that we have employed throughout (N. A., p. 670, 
Theorem IV) for irregular primes | breaks down for a particular 1, we shall find 
such an / in the range of our computations. So far no exponent to which we 
have applied the criterion has belonged to this class. 


2. Congruence properties of the Bernoulli numbers. In a previous paper a 
number of congruences of this character were derived,’ and we shall here gen- 
eralize and simplify some of these results and derive others to be employed later. 
If p is prime, consider the identity 


_ 1 
(1) (x?* 4 1) ct ee (2”™ Ps 1) x 
z—] r—1 
and write 
<? (7) =T2 + yal 1? + Sena ao (ps = | lie ai 
fora > l and 
f(z) = ] + zx + xr 4. ee 7 grt 
These functions will be called Mirimanoff polynomials. If e is the Napierian 
base, we see that 


] (s) _” 
(2) | Jn = ?] = fiii(a). 
v veal) 
In lieu of this we may employ the formal operation 
x df," (x) 
Raby 


but we shall use the exponential function as most of the papers which have been 
written along these lines employ it. 

Setting « = e'z in (1), differentiating a times, and setting » = 0, we have, 
employing Leibnitz’ theorem, 


(x — 1)fS\(2) + apke™f\” (x) 


(3) é mayne i A 
= (2"" — I)fayi(2) + apma?”"f,” (x) (mod p’). 
Let p be an m-th root of unity # 1; then the last relation gives, if m 4 0 (mod p), 

m—1 m1 m1 
8 (p”™* — If." i(p) + ap os kp” f."(p) = apm > f*'(p) (mod p’). 

k=l kel kewl 

Now 

fr" (pe) = V+ wr +--+ 0”) (pe) = 0 (mod p), 


* Proc. Natl. Acad. Sei., vol. 16 (1930), pp. 139-150. 
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whence 
(4) — DO’ mf2i(o) = apm , DY 6°) (mod p’), 


ae . . . . 
where > indicates summation over all the distinct values of p. 
Consider the summation 


DY feri(e). 


If we carry the summation over each term, then the term of the form pa fp = 
—t* for t not divisible by m, whereas if t is divisible by m the term is (m — 1) t°. 
Noting that if a # 1 (mod p — 1), 
pm—tl m- p—l 
=> > (j + ph)* = mS, + 2. aph Sa. (mod p); Sa = i ., 


i=l =—0 j=0 A i=l 


we obtain, using S,-; = 0 (mod p), the relation 


(5) Do fati(e) = (m**" — m)S, (mod p’). 
Hence (4), (5), and the known relation S, = pb, (mod p’) give for a even, p > 3, 


a+l 
(6) am “ 2 b, = } : fie) (mod p), 
whence 
” 1 — m* — ee! 
(7) re ba , 2d J (mod p), 
am — —— 


where [2] is the greatest integer in x. 


We have 
(n—k)p| _ _ 4 — |] a 
| n ] a |» n| ? n I, 
whence 
. 2a—1 ‘ 2a—1 
(8) ({ = = —i+ ') = -({*] + ‘ (mod p). 


From this, we may show that, if k < n, 


yin) c= —- 


k+1 4 a-k = 0 (mod Pp); 
(9) Uigie 


(m) -2n—1 
C; ~ = t . 
i=[(1—1) p/mi+t 


¢ + ne | ia H it 
n ie 


( 
In Cy?) there are 
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(n) 


terms, and in C’,”), there are 


k = be * k —k- ve | 
7 Rie eet 


terms, and these two numbers are equal since 


(n—k)p]} _ _kp}_. _ | kp) _ 
eee 


with a similar relation in which k + 1 replaces k. Now (8) gives 


> (=| —i+ 1) -> ({?| m y 


This is the relation (9). Now consider 


(n — Nc;” + (I — 1)Ct4,-2. 
Using (9) we see that this is congruent modulo p to 
(n — 21+ 1)C{”. 
Also for n odd we have 
Cirion = 0 
from (9). Hence from (9) we have, using (7), 


1 oe n™* [n/2] 


= 2D (n—- +100)” 


** Qan%e rs 
(10) Pie ot 
: (m= — pew es: > > co. 
=a l=1 pl 
For an odd n = n, this reduces to 
eo [n,/2] a 
(10a) boa 1 ny = YD (n + 1 21) cr 


a—l 
4an\" i=1 2 


To accord with the notation employed in other papers we write 


(6) a (4) 
Cr = Ci; Ci =A: 
defined for p > 6. 
Now for 2a < (p — 1) we have 
2 »p | 2 
l1—n™ né -n" 
Ma « ’a—1 2a ‘ ’a—! 

2an™ 2an™ 

n” n 
boa 


(mod p). 


(mod p) 


(mod p) 


(mod p). 


(mod p). 











p). 
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Hence we obtain from (10) and (10a) 


p-20 
(11) ae ban = 5C, + 3C + Cr 
(12) (AP — Dom ws 841 + As 
2a 
p-2a , 
(13) Gr — he 0,40, 
4a 
observing that 
(p/3) 
C; + C2 = e 
s=1 


We note also that 
A, + A. =C,4+C€.4+ C3 
and (11) gives 


p—2a — b 
Oe Cs + CD + 20 + An + As 
p—2a as p—2a ed 
w= — 4 * 440. + 2(C; — AD 
2a 2a 


This gives 


2a 


— 37" — 4” 4 6” 


. ep | 
Ay _ C; a (—1) B, 4a 


(2”-* — 1)(3”"" — 2”"" — 1) 


—1)° 2 
(—1)°E - 


Ill 


and this may be put in the form 
[p/4] 


ay 1S tec DOM 


s=[p/6)}+1 da 


for p > 7, 2a < p— 1. 
From (10) we have 


(2° * — Db 
(15) ~ &A,+ Az 
which with (13) gives 
. Dea 2(A, + Az) — 2C; 


\a/2) gi ttt gi-te 4 


(—1)""'B, 
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(mod p); 
(mod p); 


(mod p), 


(mod p). 


(mod p), 


(mod p) 


(mod p) 


(mod p) 


(mod p). 
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Now set 
D, = c?", 
then (10a) gives 
(17) bog . 5" = 2D, + D, (mod p). 
4a-5°*! 


Now (11) and (15) give 


(6”-* — 6)bea (2?-* — 2)bea 


= 4 : ” 
5a C, + 2C. + 5a (mod p) 
or 
} af pie 9? 2a 
Deal 6 F 20, + C2 (mod p). 


4a 
Subtraction of this from (17) gives, modulo p, 


~p-2a = p--2a p—2a = 
(18) 2D, — Cy) + De — Co = ba = G — : :. 


3. Examination of primes as to regularity. The first step in our examination 
of a particular 1 in (1) was to determine if it was regular, so that we could 
apply the known result of Kummer to the effect that (1) is impossible if lis a 
regular prime. For this purpose formulas (14) and (16) were mainly employed 
heretofore. For the larger primes, however, (18) was found to be more valuable 
and was mainly employed in the present work, where 1 = p. The number of 
values of s used in (14) is approximately [1 — 1/12]. The number required in 
(18) is larger, but there exists a relation between the values of s in (18) which 
does not hold in (14). The range of values for s in the expression 2(0D,; — C)) 
is from {1/6} + 1 to [1/5], where each value is used twice. The expression 
D, — Cy contains the values from [1/5] + 1 to [21/5] less the values from [1/6] + 1 
to [1/3|. Thus the negative values in the second expression cancel the double 
values of the first, leaving the values of s in (18) in two ranges, [1/6] + 1 to 
(1/5) and [1/3] + 1 to [20/5]. 

Let n be a number in the range [1/6] 4+ 1 to [1/5]. Then n = [1/6] + 4, 
where 7 is a positive integer. Let g (1/6). Then 1/6 j + k/6, where 
Q0<-—k — 6, and l/3 23 + k/3. Therefore, 2{l/6) 2 |l/3] 1, and 2n 2 
(3) + 1. Also n <= [l/5], and as above 21/5) < [2l/5|. Therefore, 2n s 


(21/5). ‘Thus we have the relation: 
9) If n te a number in the range \l/6\ 4+ 1 to \L/5|, then 2n is in the range 
(1/3) 4+ 1 to (20/5). 


Jt also follows from the above argument that if m is an integer such that 2n 
is in the range (1/3) 4 1 to [20/5], then n is in the range [l/6) 4+ 1 to [l/5|. Thus 
for a particular value of a in (18) it is sufficient to compute the powers of the 
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odd values of s in the range [1/3] + 1 to [21/5]. The sums of the powers of the 
even values may be found in one operation by computing 


(20) 9*4 1 Ps . 


M. M. Abernathy has obtained a result concerning primes of the form 4n + 1. 
If index s = 2k, then 


ind s?"? = 2n.2k = 4nk 


0 (mod / — 1). 


ry i—1)/2 i—1) /2+ . 
Cherefore s° = 1 (mod J) and s' * = s' (mod 1), where ¢ and & are 
integers and | is a given prime of the form 4n + 1. If index s = 2k + 1, then 


ind sg“? = 2n(2k + 1) = 4nk + 2n = 2n (mod | — 1). 


Therefore s‘“* = —1 (mod J) and s“?**! = —s' (mod 1). Thus in testing 
for regularity of primes of the form 4n + 1, powers of s greater than (J — 1)/2 
do not have to be computed. Division of values of s into two groups, those 
of even and those of odd index, permits computation of 


by taking 
(21) My —(-D/2 _ > & 1-1 . 
*1 #3 
where s; ranges over values with even index, s: over those of odd index. 

In computing the right-hand member of (18) it sometimes happened that 
this was = 0 (mod 1) because the second factor on the right had its numerator 
divisible by 1 = p. In these cases (14), (15), and (16) were employed until the 
numerator of the factor not involving be, was found to be # 0 (mod /). When 
a B, was found with its numerator divisible by J, then this was checked by 
employing the tables of Bernoulli numbers of Lehmer or of previous writers, 
provided the Bernoulli number fell within the range of their computations. 
Further, to aid in this part of the work, Lehmer suggested and earried out the 
division of each B, (a S 110) by each of the primes 1, where 547 S /] < 601, 
and for all primes /; of the form 4n + 3, where 601 < 1, < 619. Jacobi's table 
of indices” was employed to carry out the other part of the computations. We 
shall now give an example of the method for 1 = p = 541. For this ease, the 
terms in the right-hand member of (18) are 


108 
AD -— C) =2D s*", 
s=91 
216 
(De (.) = 3 . 
sets 


’Canon Arithmeticus, Berlin, 1839, 
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For each s appearing in the above ranges, s*” ' was computed employing Jacobi’s 
tables for a = 1, 2,--- ,(l — 3)/2; then the computations of D; — C; and 
D, — C2 were each separated into two parts since information about some of 
the terms s““’ in the second sum can be obtained immediately from some of 
those of the first, as we indicated at the beginning of §3. For the primes 
mentioned above tested by Lehmer we started with a = 111 in lieu of a = 1. 
In the computation of s“’ (a = 1, 2,--- ,(l — 3)/2), periodicities may 
appear, which, since we are using indices, may be determined in advance. If 


1 — 1 = km and index s = kn, then s will begin repeating at s”*'; for s™? = 
s (mod 1) follows from the fact that since r“" = s, 
kn(m + 1) = kmn + kn = kn (mod J — 1). 


At any stage of the work we have another check by the possible use of the 
formula 


and in particular the convenient relation 


(t—1)/2 


> s*'=0 (mod J). 

a=l 
The regular’ primes l (306 <1 < 619) are as follows: 313, 317, 331, 337, 349, 
359, 367, 373, 383, 389, 397, 419, 431, 439, 443, 449, 457, 479, 487, 499, 503, 


509, 521, 563, 569, 571, 599, 601, 613, and the irregular primes are 307, 311, 
347, 353, 379, 401, 409, 421, 433, 461, 463, 467, 491, 523, 541, 547, 557, 577, 
587, 593, 607, 617. 

As to the time required for the work on a particular prime I, close to the 
value 600, in a test for regularity, a person experienced in this work requires 
on the average about forty hours to complete the test provided the prime is of 
the form 4n + 3. A prime of the form 4n + 1 requires about two thirds of 
this time. 

Concerning the numbers in the set 


(22) B,, Be, --- ’ Bu 3) /2 
which are divisible by l for a particular | we found 
B, = 0 (mod L) 


in the following cases: | = 307, n = 44;1 = 311," = 146;1 = 347, = 140; 
| = 353, n = 93, n 150;1 = 379, nn = 50,n = 87;1 = 401, n = 191; 1 = 409, 
n = 63;1 = 421, n 120;1 = 433, 183;1 = 461, = 98; 1l = 463, n = 65; 
lL = 467, n = 47,n = 97;1 = 491, n = 146, n = 169; 1 = 523, n = 123, n = 200; 


l= 54l,n 3,1 = 547, 0 135, n = 243; 1 = 557, 0 lll; 2 = 577, 
n = 26;1 = 587, n 45,n = 46;1 = 593, n 11;l = 607, n 206; l 617, 
n 10, S7, 1 169. From the above it will be noted that just two 


of the B’s in the set (22) are divisible by lin each of the cases | = 353, 379, 


© The irregular primes < 21) are listed by the writer in the Transactions of the American 
Mathematical Society, vol. 31, pp. 615, 615 616, and for 211 5 1 < 3O7in N. A, p. 667 
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467, 491, 523, 547, 587. For 617, three of the B’s in (22) are divisible by 617 
and this is the first 1 of this sort encountered in our work since the beginning. 


4. Treatment of the exponents in (I) which are irregular primes. Here we 
employ the following: 


THEOREM 1. Under the assumptions: none of the units E,(a = a), G2, --- , As) 
is congruent to the l-th power of an integer in k(¢) modulo ¥%, where ¥ is a prime 
ideal divisor of p; p is a prime < (l' — 1) of the form 1 + lk; and a, , a2, «++ , a 
are the subscripts of the B’s in the set 


B,, Br, --- , Bu-sp 


whose numerators are divisible by 1; the relation (1) is impossible in non-zero ra- 
tional integers x, y, and z. 
In the above statement, 


i= (“" =i ~t 2); 
ag-na-s)/’ 
r being a primitive root of 1; ¢ = e""'". As indicated in N. A., p. 667, to test 


this for a particular irregular ] and a p < (l — 1) of the form 1 + kl, we com- 
pute the value of E,(d) modulo p, where E,(d) is written in the form 


ae “TL. (& - ') padi 
i=0 av’ — | , 


—— O44" 4 iat 4 ph Shy 


h =1— 2n. 


d is a rational integer such that 


d' = 1 (mod p). 
In N. A., p. 668 a systematic method of carrying out such a computation 
was explained for the case 1 = 271, p = 1627. M. E. Tittle has further ab- 


breviated this scheme in general. Since 


d' = 1 (mod p), 
1) are obtained immediately from a table of the first powers 
of d, modulo p. This renders unnecessary the third row employed in N. A., 
p. 668 and eliminates the computation involved in obtaining the fourth row. 
A companion table to the powers of d, with powers of p, a primitive root of p, 
substituted and with blank spaces for the omitted numbers permits calcula- 
tion of ind (d" 1). Wd 


all values of (d"° - 


1) is not one of the numbers appearing in the 


™N.A., p. 670, Theorem IV; Bulletin of the American Mathematical Society, vol. 40 
(1934), p. 124, paragraph immediately following statement of Theorem 2 
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companion table, multiply (d"" — 1) by p, and examine the table for the appear- 
ance of this number. Repetition of this operation will yield a result in at 
most k — 1 steps, for the numbers n, pn, pn, ---,p" 'n, have as indices the 
numbers m, m + 1, m + 2,---,m + k — 1, where index n = m, and one of 
the numbers in this last set is of the form ki, 7 an integer, and all indices of the 
form ki are presented. To obtain the actual value of (d" — 1), subtract from 
the observed value in the table the power of p required in the multiplication 
p'(d" — 1). Detailed checks on this type of work are described in N. A., 
p. 669. 

The following table gives the specific results of the computation for each 
irregular prime. In the table, | is the prime exponent appearing in (I), n is a 
Bernoulli number in the set (22) whose numerator is divisible by I, r is a primitive 
root of l, d is the integer selected such that 


d' =} (mod p), 


p is a prime of the form 1 + kl referred to in the statement of Theorem 1, pis a 
primitive root of p, ind is the index, modulo 1, found for E,(d), modulo p. 
When two values of n are listed for a particular 1, then the corresponding 
indices are listed in the same order in the last column. Thus for 1 = 353, 
By; = 0 (mod 353), ind Ey;(2804) = 57 (mod 353); Biso = 0 (mod 353), ind 
Ey9(2804) = 13 (mod 353). The exponents | = 587 and l = 617 have not yet 
been tested by the criteria of Theorem 1. 


l n r d iD p ind 
307 44 5 168 1229 10 213 
311 146 308 135 1867 1857 27 
347 140 337 64 2083 2 333 
353 93, 150 3 2804 4943 10 57, 13 
379 50, $7 2 3954 4549 6 101, 200 
401] 19] 211 3° 3209 3 365 
409 63 235 2 1637 2 115 
421 120 238 6'° 4211 6 338 
433 183 10 2 1733 2 332 
461 98 10 10° 2767 10 190 
463 65 174 2 5557 2 215 
467 47, 97 10 = 2803 2 347, 87 
491 146, 169 10 5* O83 5 148, 382 
523 123, 200 10 r 5231 2 85, 228 
541 43 10 o° 9739 3 458 
547 35, 243 17 os 5471 3 477, 179 
557 11] 4] 10° 3343 10 222 
577 26 10 15 2309 2 556 
587 45, 46 
593 }] 10 Ys 1187 2 23 
607 296 575 2 3645 2 491 


617-10, 87, 169 
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It is a little curious that the smallest prime p satisfying the conditions in 
Theorem 1 gave for each I 


ind E,(d) # 0 (mod 2). 
We may now state 
THEOREM 2. 
s+ y" = 2" 
is tmpossible in non-zero rational integers x, y, and z if n is a given integer, 
2 <n < 617, excepting possibly 587. 
The above results have also been established for various prime exponents 


1 < 700 not included in the above. The computations are proceeding under 
other auspices. 


5. Application of the data obtained to other parts of the theory of cyclotomic 
fields. The numerical data obtained gives special information concerning many 
other questions in cyclotomic field theory. First, various results are known* 
concerning regular cyclotomic fields which have not been extended to other 
fields, and which have been applied to obtain results in Diophantine Analysis. 
By means of our work described here, we have isolated many regular cyclo- 
tomic fields in which all the results just referred to apply. 

Let us now consider primary units in a cyclotomic field. A unit » is primary 
in the field defined by a primitive /-th root of unity k(¢) if it is not the l-th 
power of an integer in k(¢) and if 


n=ea (mod r'), 


° ° ° os pe ° 9 ° . 
where a is an integer in k(¢) and = (1 — ¢). Iflis regular’ there is no primary 
unit in k(¢). If l is irregular, we may show that £,(¢) is primary provided 
B, = 0 (mod 1). For, we may set 


E,(¢) = E,(1) + aga — §)”. 


Then for an indeterminate w 


“ 2a , a > w — l 
(23) w E,(w)' ' = E,(1)"' + @(w)(1 — w)* + 3 w)(* ') 
v — 
since V(w) is some polynomial in w with rational integral coefficients. In this 
relation set w = e', take the logarithms of each member, differentiate * times, 
and set v = 0. Then" since for i # n 


2 , re Dien) 
(24) k an. Sa | =- L(y" BiG — yy, 
vm) 7 


dv ate | 2i 


* Hilbert, Werke, 1, pp. 278 312. Maillet, Annali di Mat., (3), vol. 12 (1906), pp 
145-178; Acta Math., vol. 24 (1901), pp. 247-256; Vandiver, Proc. Natl. Acad. Sei., vol. 17 
(1931), pp. 662-663; Monats. Math. u. Phys., vol. 43 (1936), pp. 317-320. 

® Hilbert, loc. eit., p. 287. 

© Vandiver, Transactions of the American Mathematical Society, vol. 31 (1929), pp 
619 620. 
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and fori = n 





[im ze] |. vr Be— De* = 
dv** _ 4n , 


using B,, 0 with 


dy2+1 | 


(‘— log =a] = 0, 
vel) 
we obtain 


Ee -e) 
dy 


|=0 (mod 1) (k = 1,2,---,l— 2). 


Hence 

aid — )*=xHa-H™ 
To prove that x(¢) is divisible by 1 — ¢ and hence that £,(¢) is primary we 
may write (23) in the form, d being some integer, 


1 


l 
(25) w' E,(w)'? = E,(1)' + x(w)(1 — w)* + Vi(w)(w' — 1) +d = “ 4 


In this relation set w = 1; this givesd = 0. In (25) set w = e’, take logarithms 
of each member, differentiate (1 — 1) times, and set v = 0, and we have, em- 


. v\l—-1 
ploying (24), since k v=) | = 0 (mod l) (s <1 — 1), 


dv* 
i—1 v vyl-1 
k x(e")(1 — e’) =0 (mod J), 
dy! val) 
and 
d' (] —_ e")' , 
(e' = 
E ¢') rm ie 0 (mod J), 
whence 
[x(e")|.—0 = 0 (mod 1), 
or 
x(e’) = (1 — e')w(e’) (mod 2). 


This gives the result. 

We may now show that £,(¢) is primary when B, 0 (mod lL) if we note 
also that #, is never the Lth power of a unit in k(¢) in any of the cases we 
have tested since in each such ease we found a prime ideal B such that /,(¢) 
is not congruent to the Lth power of an integer in k(¢) modulo B, where $B 
is a prime ideal divisor of p. Hence for any l where we have found £,(¢) 
primary we have explicitly determined an absolute (Hilbert) class field" of k(¢). 


4 Hilbert, loc. cit., pp. 149-156 
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We now consider the connection of our results with the second factor of the 
class number of k(¢). This factor may be written” 


Mine -** Ni, 
aor aan 
where A is the determinant 
bu ban bir 
bio ae bie | 


the b’s and n’s being rational integers not all zero such that 


hs b bes biis 
y: = ei!" €:° see er) (s = .. 2 eee v4), 
where 71, Y2, ---, ¥:, is a set of fundamental units of k(¢); «& is the unit ob- 


tained from our e¢, previously defined, by the substitution (¢/¢"'); lL = 
(lL — 3)/2. In a former paper,” the writer showed that for this second factor 
(say he) to be divisible by 1 it is necessary and sufficient that at least one of the 
units E,(¢) (¢ = 1, 2, --- , 4) be the -th power of a unit in A(f¢). In view of 
what we observed above, concerning the E’s not being congruent to the L-th 
power of an integer in k(¢) modulo %, we may state the 

TueoreM 3. If l is an odd prime and ¢ = e*"'', then the second factor of the 
class number of the field k(¢) ts prime to l for each | < 617 excepting possibly 587. 

If A is the class number of k(¢), where k(¢) is defined by an irregular prime, 
then we may write 

h = Ij, 

where j is prime tol. If we raise each ideal in k(¢) to the j-th power, then the 
ideal classes defined by these ideals will form a group called the irregular class 
group of k(¢). The prime ideal $, the factor of (p), which we have used in 
connection with each 2, which was primary, is such that 2 belongs to this 
irregular class group. This follows since E£,(¢) is not congruent to the L-th 
power of an integer in the field k(¢), and, since E,,(¢) is primary, this cannot hold 
if ® does not belong to the irregular class group. That is, B* is not a principal 
ideal unless k is divisible by l. For example, turning to the table we see that 
for | = 307 the prime ideal 8, a prime ideal divisor of 1229, belongs to the 
irregular class group, ete. Specifically, such ideals can be written in the form 


and since they are not principal, they cannot be further reduced. 


Report on the Theory of Algebraic Numbers, Bull. Nat. Res. Coun., No. 2, February, 
1928, pp. 34 and 38, with references to Kummer there given; Fueter, Synthelische Zahlen 
theorie, Berlin and Leipzig, 1925, 2d ed., p. 228 

' Proc, Nath Acad. Sei., vol. 16 (1980), pp. 743 749, 
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In connection with some previous work on Fermat’s Last Theorem we em- 
ployed for particular exponents the following (N. A., p. 670) 

THeoreM 4. Under the following assumptions 

(1) the second factor of the class number of the field k(¢) is prime to 1; 

(2) none of the Bernoulli numbers B,, (n = 1, 2, --- ), (lL — 3)/2, is divisible 
by P, 
the equation (1) is impossible in rational non-zero integers x, y, and z. 

The above theorem was applied to prove Fermat’s Last Theorem for all 
primes | < 307. Weare enabled to remove the restriction to case II mentioned 
in the theorem in N. A. because of the results in another paper of the writer’s.”* 
The question involved in the second assumption in the theorem is intimately 
connected with other questions concerning the divisibility of certain other 
Bernoulli numbers by . We have the known relation 


Bi. .o, — 2B'.., + Bi, =0 (mod P); pz = (l — 1)/2, 
where 
Bi = Ss By 


From this we obtain easily by induction” 
(26) Bi. = yB... — (y — 1B. (mod [’). 
If we now select | and n so that B2 = 0 (mod l) with as - B. (mod I’), 
it follows that there exists an integer y which yields B’.,,, = 0 (mod). Pol- 
laczek took 1 = 37, n = 16, which gave y = 7; hence Bi = 0 (mod 37’). He 
also found two other examples of this type. Hence the numerator of a Ber- 
noulli number may be divisible by the square of a proper divisor, where a proper 
divisor of the numerator of a Bernoulli number B; is one which is prime to 7. 
Pollaczek showed that a necessary condition that k(¢) contain an ideal be- 
longing to the exponent [/ was that one of the (lL — 3)/2 Bernoulli numbers 
Biursyr (t = 1, 3,---,l 4) be divisible by /. In view of the above, we 
consider the number of Bernoulli numbers in the set 


Mn Metas.*** «5 Mavi 


which are divisible by [. The relation (26) shows that if two of these numbers 
are divisible by &, then all of them are. Here we are assuming B, = 0 (mod lL). 
The work of Pollaezek mentioned above for the case | = 37 shows that 


Bai, 4 B, (mod [’). 

Henee it follows that, nce 
Bia = 0 (mod 37°), 
7 f~ 0 (mod ’) 


* bulletin of the American Mathematical Society, vol. 40 (1934), p. 118, Theorem 1 
* Polluezek, Math. Zeitschrift, vol. 21 (1924), p. 36 
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or 

Bi, # 0 (mod I’) 
for n = 16,1 = 37 likewise 

Buy F 0 (mod [*). 
Here we note that 

= (2n — Dl + 1 
2 

and if 

Bui, = 0 (mod [°), 


we may have an ideal k(¢) belonging to the exponent [. Hence Pollaczek’s 
computations verify independently for | = 37 that the class number of &(¢) is 
divisible by 1 but not by F and also that 


Nie.sr # 0 (mod 37°). 


His work (loc. cit.) in connection with the primes 59 and 67 furnishes similar 
checks. 

The data obtained concerning the examination of primes as to regularity 
furnishes much other information concerning the divisibility properties of the 
Bernoulli numbers. For example, we may find the least positive residue of the 
numerator of any Bernoulli number modulo J, where / is any prime < 619. 
For, in each case we have computed the left-hand members of one of the for- 
mulas (14), (15), (16), or (18), finding the least positive residue modulo / for 
the p used in these formulas. Thus in (14) we may conveniently reduce the 
factor on the right not involving B, employing Jacobi’s table of indices and 
also using the explicit known form of the denominator of B,. Fora > (J — 3) 2 
we employ the known formula 


= & B 
(— 1) o—* = ~ (mod l) 
a—p a 
except when a is a multiple of ww = (J — 1) 2. In the latter case we have 
pBy, = (mod 1) 


which enables us to reduce the numerator of B,, , modulo / 

We may note that arithmetical machines were not employed in the computa- 
tions described in this paper except in connection with addition. — In my opimion, 
there are probably many other types of number theoretic computation in which, 
in the future, it will be very convenient to employ, if they exist, tables of indices 
for primes beyond 1000.) In connection with the second type of computation 
involving the treatment of irregular exponents in (1) we constructed special 
tables of indices as previously described, ‘That is, for the value of p which we 
used, we set up partial companion tables from which we ean quickly obtaim 
the smatlest residue of any integer raised to any power modulo p and having 
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given an integer we can find what power of p is congruent to that integer modulo 
p where p is a primitive root of p. For the primes < 211 these companion 
tables are complete for each p selected in connection with each irregular prime l. 
For the convenience of anyone who may wish to make use of these particular 
tables, I shall list values of p > 1000 that we employed in this connection. 

1187, 1229, 1543, 1579, 1627, 1637, 1699, 1733, 1867, 2083, 2309, 2767, 2803, 
3209, 3343, 3643, 4211, 4549, 4943, 5231, 5471, 5557, 9739. 

All the numerical data obtained in the course of this work will ultimately be 
deposited in the library of the American Mathematical Society. 


University oF Texas. 
NOTE 
Since the above was written, Fermat’s Last Theorem has been proved for the 


case | = SST. Hence, in view of Theorem 2, the theorem is true for all prime 


exponents less than 617. 











ON THE NECESSARY CONDITIONS FOR THE MINIMUM OF A 
DOUBLE INTEGRAL 


By Max Cora. 


1. Introduction. The purpose of the present paper is to exhibit an intimate 
connection which exists between the theory of the calculus of variations for a 
double integral and the corresponding theory for a simple integral. The varia- 
tion problem which will be discussed is that of minimizing an integral 


(1.1) ff foxs ys sup +525 e0n2tey o++ 5 Z0es By 22+ Bededy 
. i 


in a certain class of sets of functions z,;(2, y) (@ = 1, 2, --- , ) all of which take 
on the same values on the boundary of the region A of integration. When it is 
not assumed that the minimizing set Z;(r7, y) (@ = 1, 2, --- . nm) have continuous 
partial derivatives of order greater than the first, the differential equations 
which must be satisfied by the minimizing set were first derived (at least for 
the case n = 1) by Haar [1],' [2], who made use of his so-called Fundamental 
Lemma of the calculus of variations for double integrals. A survey of the 
literature concerning this Fundamental Lemma will be found in the Chicago 
dissertation of Miss Huke [8]; the proof of the lemma was simplified considerably 
by Haar in his last paper [4] on the subject. 

Of the further necessary conditions for a minimum of the integral (1.1), the 
analogue for double integrals of the Legendre condition for the minimum of a 
simple integral was first established by Mason [5] for the case » 1. The 
analogue of the Weierstrass condition was first proved by FE. ke. Levi (6) for the 
Case n Land by MeShane [7] for the general ease. We shall not be concerned 
in this paper with the analogues of the Jacobi condition. 

The Fundamental Lemma of Haar is unnecessary for the development of the 
theory of the caleulus of variations for the integral (1.1) and the well-known 
Du Bois-Reymond lemma of the theory for simple integrals suffices. Indeed, 
it will be shown below that there is associated with the problem for the integral 
(1.1) an auxihary minimum problem fora simple integral and that the necessary 
conditions of Haar, Weierstrass, and Legendre for the problem involving the 
integral (1.1) can be respectively deduced as simple corollaries of the necessary 
conditions of Muler, Weierstrass, and Legendre for the auxiliary problem. The 
condition of Haar will be derived below in a moditied form involving integral 
equations 


Received May 24, 10387.) The results contained in this paper were obtaimed in part by 
the author during his tenure of a National Researeh Fellowship 
' Numbers in square brackets refer to the bibliography at the end of the paper 
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2. Hypotheses. Let A be a region of points (z, y) and let 
(2.1) zi = Z,(x, y) [(z, y) in A;t = 1,2,---,nJ 


be functions which are continuous and which have continuous partial deriva- 
tives of the first order in A. The function f(z, y, z, p, q) will be supposed to be 
defined and continuous for sets 


(x, y, 2, p,q) = (2, H, Bas <** p Bas Mae *** sar Ms *** >) 


in a neighborhood G of the sets (2, y, Z, Zz, Z,) belonging to (2.1), and in G 
the function f shall have continuous derivatives of the first and second orders 
with respect to the arguments z;, pi, gi (@ = 1, 2, ---,n). 

We shall denote by MN the class of sets of functions 


(2.2) 2; = z,(z, y) [(xz, y) in A;t = 1,2,---,n] 


which have the following properties: 

(a) the functions z;(z, y) are continuous in A and the region A may be decom- 
posed into a finite number of subregions, each bounded by a simply closed 
regular curve, such that on each subregion the functions z,(z, y) have continuous 
partial derivatives of the first order; 

(b) the elements (x, y, z, z., 2,) belonging to a set z,(z, y) are all in G; 

(c) the functions z,(z, y) take on the same values on the boundary of A as do 
the functions of the set (2.1). 

Under these hypotheses the integral 


- | [ se, Y, 2, Zz, 2,)dxdy 
4 


has a finite real value when computed for any set z(z, y) of the class Dt. We 
shal] suppose that the set (2.1) furnishes a minimum to the integral J in the 
class Wt. The necessary conditions of Haar, Weierstrass, and Legendre which 
must then be satisfied by the minimizing set (2.1) will be derived below by 
consideration of an auxiliary minimum problem for a simple integral. 


3. The auxiliary minimum problem. Consider any point (x29, yo) interior 
to A and let To represent a ring of points (x, y) defined by the equations 


(3.1) L Ly + pcos by yo tpsind (O<nS&psr;0 508 2m), 


where ris chosen so small that [ lies interior to A. Let 70) be any function 
which is continuous on O = 6 = 2r and such that 7'(0) T(2r) and which 
has the further property that the interval 0 < @ < 2m may be decomposed into 
a finite nurober of subintervals on-ceach of whieh 70) has a continuous first 
derivative, and let I represent the class of sets of functions Ri(p) (mn S p S 7; 
i ve n) which with their derivatives Ri(p) are continuous on 7; 

po ~ rv and have all their elements [p, R(p), R’'(p)| sufficiently near the sets 
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(op, 0, 0) and for which Ri(m) = Rr) = 0 (i = 1, 2,---,m). For any set 
R; of N put 
(3.2) ¢: = Zilxe + p cos 0, yo + p sin 0) + Ri(p)T(@) 


(1SpS7r,05 0608 2n;i = 1,2,---,n). 
Then in T we have 


¢ = Zi. + RT cos 6 — R;T' (sin 6/p), 
(3.3) (i = 1,2, ---,n). 


a. , ales 
> t& = Zi, + Ry T sin @ + RT’ (cos 6p), 
y 
If the definition of the functions ¢; is extended so that ¢; = Z; in A — T 
(¢ = 1, 2,---,m) the resulting set of functions clearly belongs to WM. With 


7(6) held fixed, f(z» + p cos 0, yo + p sin 6, ¢, ¢, ¢,) becomes a function of 
(p, 0, R, R’) as is clear from (3.2) and (3.3). If now we put 


(3.4) g(p, R, R’) = [ S (xo + p cos 8, yo + p sin 6, &, f., Fy) ae, 
0 


then g(e, R, R’) is defined for all R and R’ sufficiently small and for all p such 
that the circle x = x9 + pcos 6, y = yo + psin @liesin A. Then the minimizing 
property of the set (2.1) implies that the set Rip) = 0 (1 S p sr; 
i= 1,2, .-- , n) furnishes a minimum to the integral” 


J = [ gle, R(p), R'(p)| dp 


in the class N. 


4. The necessary condition of Haar. A first necessary condition which must 
be satisfied by the minimizing set (2.1) is contained in the following theorem: 

Turorem |. For every simply closed regular curve C lying interior to A the 
equations 


(4.1) | fo.y — f,,dr = | | f.dedy (i = 1,2,---,n) 


are satisfied, the arguments of f,, ,f,, and f., being the set (x, y. Z, Zz, Zy) belonging 
to the minimizing set (2.1). 

The condition (4.1), while not in the form given by Haar, is equivalent to his 
system of differential equations. The theorem will be proved first for the case 
when C is a cirele. Let C have its center at (20. yo) and radius r. Let ry be 
any positive number less than rand consider the ring [ with center at (ro. Yo) 
and radii ry, and vr. For the corresponding auxiliary minimum problem, con- 

2 The integrand g(p, BR, BR’) possesses all the continuity properties needed to establish 
the first three necessary conditions for the auxiliary minimum problem. See Carathéodory, 
Variationsrechnung und partielle Differentialgleichungen, Leiprig, 1935, p. 190, footnote 
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structed as described in the preceding section, the minimizing set R,(p) = 0 
(1. S pS r;i = 1, 2,---,n) must satisfy the following equations, which are 
the Du Bois-Reymond form of the Euler equations for the problem: 


gri(r, 0, 0) = / Jr, (p, 0, 0) dp + gei(ri, 0, 0) (i = 1, 2, ey n) 


or, by virtue of (3.4), with 7(6) 


1, 
(4.2) [ [(f,, cos @ + f,, sin @)p)f—", dé = [ f:,odpd@ (¢=1,2,---,n), 
0 ri 0 


the arguments of f,, , fo, , fz; in (4.2) being the set [x, y, Z(x, y), Z.(x, y), Z,(x, yy! 
with x and y replaced by the values given in (3.1). Taking the limit as ri 
approaches zero in (4.2) one secures the desired equations (4.1) for the case of 
the circle C. 

Now let C be any simply closed regular curve interior to A. Consider a 
closed region A’ interior to A and containing C in its interior, and let r > 0 
represent the minimum distance between the boundaries of A and A’. If 
C.,,. is a circle with center at (x, y) and radius p, then the functions 


M;,(2, y) _ / f,, dé dn, 


II S,, dé dn, (i = 1, 2, seen) 
If f., dé dn, 


in which the arguments of f,, , f,, and fz, are the set [r + & y + n, Z(x + &, 
y +n), Z(xr + ty +n), Z,(x + &, y + n)], have the following properties: 
(a) for each value of p on the interval 0 < p S r the functions (4.3) are single- 
valued and continuous for (z, y) in A’; 
(b) for each value of pon 0 < p S r the functions (4.3) have continuous first 
partial derivatives with respect to x and y for (x, y) interior to A’; these deriva- 
tives are given by 


Il 


(4.3) N,,(z, y) 


P(x, y) 


! So dn, 
ars (i = 1,2,---,n) 


-| So: dé, 


with similar formulas for the partial derivatives of N,, and P,,; 


Il 


2 M.,(z, y) 
or 
(4.4) 


II 


be M,,(2, y) 
Oy 


* Property (c) is readily proved by means of the mean value theorem. For a proof of 
the analogue of property (b) for triple integrals see O. D. Kellogg, Foundations of Potential 
Theory, Berlin, 1929, p. 224. 








of 
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(c) the functions (4.3) satisfy the conditions 
lim (1/xp*) M;,(z, y) = fp, [x, y, Z(2, y), Zz, y), Z,(z, yl, 


e-0 


lim (1, rp) N;,(z, y) = Sait, Y; Z(x, y)) ZAx, y), Z,(z, y)], (z = 1, 2, C1O%y n) 
p70 
lim (1) mp) Pi,(x,y) = f.,[x, y, Z(x, y), Ze(z, y), Z,(z, yl, 


uniformly for (x, y) in A’. 
Consider now any value p of the interval 0 < P <r. Since for each point 
(x, y) of C the equations (4.1) hold on the circle C,,,, , we have 


(1, LLY f. fu, dda) de dy = (1, | [( 4 Jp,dn — So; de) drdy 
= (1, wf [[ mate y) 
ceLor 


Pan 
+ ay N,,(z, ») Jardy 


= (1/mp’) [ M,,(x, y) dy — Ni,(x, y) dz 


(¢ = 1,2, .--,n), 
by virtue of Green’s Theorem and property (b) above. Using property (c) we 


secure the equations (4.1) by taking the limit as p approaches zero. 


5. The necessary condition of Weierstrass. The Weierstrass E-function is 
defined to be 


E(x, y, z, p, a, P, Q) = f(z, y, z, P, Q) — f(x, y, z, p,q) 
— (Pi — pdfr.(z, ¥ 2,2, D — Qi — Whale, yz P.O, 


the repeated subscript ¢ indicating summation with respect to 7¢ over the 
range 7 = 1, 2,---,n. We have then the following result: 
THEOREM 2. At each point (x9 , yo) of A the inequality 


(5.1) Elxo , yo, Z(t0, yo), Zz(Xo , Yo), Zy(X0, Yo), P, Q] = O 
holds for every set (P, Q) for which [xo , yo , Z(x0 , yo), P, Q| ts in G and for which 


the matrix 
(5.2) 
has rank one. 


Since the matrix (5.2) has rank one there exists a pair of constants (a, 6) # 
(0, 0) such that 


II P; — Z ix(2o ’ Yo), Q; os Z iy (Xo ’ Yo) 1} 


(5.3) alP; — Zire, yo)| + IQ: — Ziy(xo, yo] = 0 (§=1,2,.--,n). 
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Let a (0 < @ S 2x) be such that sina = a/(a’ + b*)! and cosa = —b/(a’ + b*)! 
and determine constants Rj (i = 1, 2, ---,n) by the equations 


(5.4) Ri = [P; — Zizlto. yo)] cos a + [Qi — Ziy(ro, yo)| sin @ 


It follows from (5.3) and (5.4) that 
Ri cos @ = P; — Zi(x0, yo), 
Ri sin a = Q: — Ziy(2o, yo), 
The desired inequality (5.1) now becomes 
(5.5) E(xo. yo. Zo. Zor, Zo, Zor + R’ cos a, Zy, + RK’ sin a) = 0, 
where we have put Z» = Z,(20, yo), Zior = Zizlto, Yo), Zinoy = Ziy(Xo, Yo) 


(«= 1,2,---,n). 

Now consider again the auxiliary minimum problem constructed in $3. The 
minimizing set R,(p) = O(m S p S r;t = 1,2, --- , n) must satisfy the Weier- 
strass necessary condition for that problem. Hence for p = 7, 


5.6) g(r, , 0, R’) — g(r,, 0,0) — Rign:(rr, 0, 0) = 0. 
After division by r; and the use of (3.4) and (3.3) we secure, by letting 7 ap- 
proach zero, 


(2 


5.7) | Eloy yor Zoy Zoey Loy Boe + R'TO) cos 0, Zo, + R'T(O) sin 0]d0 = 0. 


Suppose that the inequality (5.5) is false. Then there exists an interval 


‘ 


0 “a £6 = a < 2x to which @ is interior, such that 


(5.8) Elto, yo. Zo. Boe. By. Sor + RR’ cos 0, Zy, + R’ sin 0) < 0 


(a, S 0 S ag). 


(Here and in what follows the modifications which are necessary in case a = 0 
ora = 22 will be obvious to the reader.) Choose a positive number eso small 
that a, e and a + eure interior toO < 6 < 2m and define 7',(0) as follows: 
‘(e+ 0 )/¢ (ex «¢€S5 0S a), 

(5.9) 1%) 4] (a, <— 0 ay), 
(¢ 4 + «m)/e (a2 = 0 < ag + 6), 


and 7,,(9) QV elsewhere on O © 6 © 2m. For this funetion 7.00) the integral 
7 


iy 5, thee ‘ 
Be a4) Das 


(5.1, [ Khe, yo, Bo, Bor, Boy, Bar 4 UV AO) ©0% 0, Zo, 4 RTO) sin Odd, 
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and this integral may be broken up as follows: 


(5.11) im Bae = [ Ede + | Ede + | Bde, 


the arguments of E being the same as in (5.10). 

The second term on the right of (5.11) is negative by virtue of (5.8) and 
dominates the right side of (5.11) for sufficiently small ¢«, since the first and 
third terms clearly approach zero with «. For our present choice of 7(@), then, 
we have a contradiction with the inequality (5.7), which must hold for all fune- 
tions 7'(@) of the type described in §3. Hence the inequality (5.5) is true. 


6. The Legendre condition. The necessary condition of Legendre for the 
minimizing set (2.1) ean be secured by well known means out of the Weierstrass 
condition. We shall derive it here, however, from a consideration of the 
auxiliary minimum problem. 

THEOREM 3. At cach point (x9 , yo) of A the inequality 


(6.1) Soimg Ai An + pig Ac Be + fo.q,.Bi Bi = 0 
holds for every set (A, B) for which the matrix 
(6.2) A; ‘ B; j 


has rank one. In the left member of the inequality (6.1) the arquments of the 
derivatives of f are the set [xo . Yo, Z(t0. Yo), ZAte. Yo), Zy(Lo. Yo)| belonging to the 
minimizing set (2.1). 

As in the previous proof, determine a pair of constants (a, b) = (0, 0) as solu- 
tions of the equations 


(6.3) aA; + bB; = 0 (¢ = 1,2, ---,n) 
and define a@ by the equations 

sina = a/(a’ + B*)', cosa = —b/(a* + b*)', (0 S a S 2r). 
Determine constants Cy (7 1, 2, ---,m) by the equations 
(6.4) Cc, A; cos a + B, sin a (a ca Jn) 


Then in consequence of (6.8) and (6.4) we have 


C, cor a A;, 
\t l 2, n) 
Cc, sin a B, ‘ 
and the inequality (6.1) becomes 
(6.5) = cos «a t 2f,.g, COS a sina + | sin’ WMC Cy, 2 0 


Suppose the mequality. (6.5) is false. An argument similar to that im the 
preceding seetion would lead to the existence of an interval a, S @ Say interior 
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to0 S 6 S 2rand toa function 7,(8), defined as in (5.9) for ¢ sufficiently small, 
such that 


(6.6) [ (focp, COS O + 2fy,q, COS O Sin 6 + fy, sin’ 6) T.(0)” C;C,. dd < 0, 
0 


the arguments of the derivatives of f being the set 
[x0 , yo, Z(Xo, Yo), Zz(Xo , Yo), Zy(To , yo)]. 


On the other hand, the Legendre condition for the auxiliary minimum problem, 
which must be satisfied along the minimizing set Ri(p) = 0 (n S p S 7; 
i = 1, 2, --- ,m), requires that 


Jrini(ti, 0, 0)C;C, = 0. 


If this inequality is expressed in terms of the derivatives of the function f by 
means of (3.4), and the result is divided by 7 , the inequality which results in 
the limit as 7; approaches zero is found to be in contradiction with (6.6). Hence 
the supposition that (6.5) is false is incorrect. 


BIBLIOGRAPHY 


1. A. Haar, Uber die Variation der Doppelintegrale, Journal fiir die reine und angewandte 
Mathematik, vol. 149 (1919), p. 1. 

2. A. Haar, Uber eine Verallgemeinerung des Du Bois-Reymondschen Lemmas, Acta Lit- 
terarum ac Scientiarum, vol. 1 (1922), p. 33. 

3. A. Huxe, An historical and critical study of the fundamental lemma of the calculus of 
variations, in Contributions to the Calculus of Variations, 1930, Chicago, 1931, pp. 
45-160. 

4. A. Haar, Zur Variationsrechnung, Abhandlungen aus dem Mathematischen Seminar des 
Hamburgischen Univ., vol. 8 (1930), p. 1. 

5. M. Mason, A necessary condition for an extremum of a double integral, Bulletin of the 
American Mathematical Society, vol. 13 (1907), p. 293. 

6. k. FE. Levi, Sulla necessita della condizione di Weierstrass per Uestremo degli integrali 
doppi, R. Accademia dei Lincei, Atti, vol. 24 (1915), p. 353. 

7. KE. J. MeShane, On the necessary condition of Weierstrass in the multiple integral problem 
of the calculus of variations, Annals of Mathematies, vol. 32 (1931), p. 578 and p. 723. 


WayNeE UNIVERSITY. 








CONCERNING APPELL SETS AND ASSOCIATED LINEAR 
FUNCTIONAL EQUATIONS 


By I. M. SHEFFER 


Introduction. We have elsewhere considered the linear difference equation 
with constant coefficients’ 


k 
(1) YX aiy(x + w) = FO), 
= 
from the point of view of a local solution. That is, assuming only that F(z) 
is analytic about a point, we have shown that y(r) exists satisfying (1) in the 
neighborhood of this point. Now (1) is a particular case of the general linear 
differential equation of infinite order* 


(2) Lly(z)] = De anys) = F(z), 
where if we set 
(3) L® ~ D a,t" 


and call L(t) the generating function for the operator L[y], then the generating 
function for (1) is 


k 
LO = Dd aje**'. 
j=l 

This suggests the possibility of developing a local theory for equation (2), at 
least when L(t) is suitably restricted. 

The solubility of equation (2) is linked with the problem of expanding F(r) 
in a series of Appell polynomials {P,(r)} generated by L(é); i.e., where the 
sequence {P,(x)} is defined by 


(4) Le” ~ Dd P(t". 


n=O 


We see this formally from the fact that 


L [: = P,(x), 
nm. 


Received June 1, 1937; presented to the American Mathematical Society, April, 1937. 

' Sheffer, Transactions of the American Mathematical Society, vol. 39 (1936), pp. 345 
379, and vol. 41 (1937), pp. 1538-159. 

? For an investigation of equation (2) from another point of view see H. T. Davis, 
American Journal of Mathematics, vol. 52 (1930), pp. 97-108. 
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and therefore if F(x) has the expansion 
(6) F(z) = Dien P,(2), 


a formal solution of (2) is given by 
(7) yz) = Dew. 
n. 


We are thus led to examine those functions L(t) whose Appell polynomials can 
be used to expand the general analytic function. 

For this purpose the following classification of functions L(t) seems to be 
significant: 

(i) L(t) ~ ym a,t" has a zero radius of convergence. 

(ii) L(t) = > a,t” has a finite, non-zero radius of convergence. 

(iii) L(t) is an entire function, not of finite exponential type.’ 

(iv) L(t) is of finite exponential type. 

The functions of classes (i) to (iii) appear to be inadequate to expand the 
general analytic function; we accordingly restrict our attention to functions of 
class (iv). We shall not prove the inadequacy of classes‘ (i) to (iii); rather, we 
shall give some examples to suggest the truth of the statement. 

Suppose L(t) = 1/(t — a) (a # 0), so that L is of class (ii). One finds from 
(4) that 


1} 2’ a gle 1 
PJs) = - | + + +--+ i 


a\n! (n — l)!a (n — 2)! a? a” 

or: 
] (ax) ™ (ax)" 
Pte = Sat 0! 1! n! 
a” e™ 2 
Now the expression in brackets can be written 1 — r,(7), where 
ze) = 6 (azx)"*' [ ‘ (ax) 1, (ar)” 1 vf 

(n + 1)! n+2 (n + 2)(n + 3) 

and, 


r(x) < 2e"'“CalR)""'/n +1)! (n= Ne sufficiently large), 


where ¢ <= Rk. Again, 


> C, P.(2z) eo > Ca “i > role) 


an?! ant! 


L(@) > a, is of finite exp. type pif lim sup) nta, '/* = p 
neh 


‘Jt would be of interest to have classes G) to Gii) investigated, in order to determine 
the set of functions possessing a convergent expansion in the Appell polynomials generated 


by mesbers of these classes 
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We see, therefore, that > c, P,(x) converges for at least one value of x if and 
only if > c,/a" converges; and if convergent for one z it is convergent for all x. 
That is, when the series converges the sum is an entire function. Even so, not 
all entire functions can be expanded in this manner. We are thus far from 
having an expansion for the general analytic function. 

As a second example, choose L(t) = e’'* which is of class (iii). The asso- 
ciated Appell set is the set of Hermite polynomials, and for these it is known 
that the expansion converges in a horizontal strip of which the real axis is the 
bisector. Here again the general analytic function possesses no (convergent) 
expansion. 

The generating functions L(t) of class (iv) come closer to our aim; and it is 
the Appell sets generated by this class that we study in the present paper. 
In Part I we determine the character of the regions of convergence of Appell 
expansions. They are bounded by simple closed convex curves, and their 
position depends on the singularities of a function related to L(é). In Part I 
we obtain a solution of equation (2); not, indeed, for all analytic functions, but 
rather for all functions that have a radius of convergence exceeding a suitably 
chosen number (depending on L(t)). This then permits us to find a class of 
functions possessing a convergent Appell expansion. 


Part I. Convergence regions for series of Appell polynomials of class (iv) 
Let 


(1.1) LO = > a,t" 


be an entire function of finite exponential type (see footnote in Introduction), 
and let {P,(7)} be the set of Appell polynomials generated by L: 


(1.2) e“ LO) = > Pit". 


Definition. Let h(@) = DCe,t" be of exp. type o < «©. Then the series 
H(t) = Do nte,t" has a radius of convergence 1/o, and we say that A(t) ts the 
Borel entire function associated with H(t). We shall denote this relation by 
h(t) = BEF {H()}. 

LemMa 1.1. Forn = 1, 


— 1") 
‘ ml te * 3K ’ (n . 
(1.3) ier BEF { (l—tr { 


The proof is straightforward, and need not be set down here. 
= 


Now e“ L(t) = & a,(e“t"), so that formally we have 


e (2 1)" ) 
(1.4) > Pict = BEF {> a." __} 
; a tr 
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Relation (1.4) is however more than formal; it is valid for z in any bounded 
region R and ¢ sufficiently near to the origin (how near depending on R). For 
the expression in braces is uniformly convergent in z and ¢ for z in R and | t | 
sufficiently small, and may therefore be expanded in a convergent power series 
in t by Weierstrass’ theorem. On doing so one finds for the coefficient of t” 
the expression 


ax" + naz” + n(n — 1l)aez”™” + --- + nian, 


so that (1.4) holds, since from (1.2) we obtain 


n—l 


5 net eee tee a 
(1.5) P,(x) = ao | + a in = + + an. 


If in (1.4) we replace t by 1/t and divide through by t, we obtain 
THEOREM 1.1. Let L(t) of (1.1) be of exponential type a < ~. Then 


< n! — n!P,(r) 
ue a eal — x)" i. X pnt 
whenever both series converge. The right-hand series converges uniformly in x and t 
for z in any bounded region Rand tin’\t| =p > «+d, where d is the maximum 
value (or least upper bound) of x in R. The left-hand series converges uniformly 
for zandtsuchthat t—z 21> ca. For zinany bounded region, (1.6) holds 
forall t sufficiently large. 

Consider now the function L*(t) of which the BEF is L(t): 


(1.7) L*(t) = po n'a,t". 
0 
Then 
] nla, 
(18) (7) = 


and on comparing with (1.6), 


(1.9) = 1 ( )- =. 
— rt 


, 2 {"*! 


valid for ¢ in any bounded region and | ¢ sufficiently large. 

equation (1.9) is an important relation, for by means of it we can acquire 
information concerning the behavior of | n!P,(r) |" for large n. This is a 
consequence of the simple observation that the cirele of convergence for the 
right-hand member of (1.9) is determined by the most distant singularity, of 
the left-hand member, from the origin. 

Define Alu) by 


(1.10) A(u) uL*(u), 


4h) imofexp type fta4+ oz 





i ed 


a} 


if 
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and suppose for the moment that A(u) is single-valued throughout its domain 


of existence. Let G = {a} be the set of all the singularities (including ~ if 
necessary) of A(u), and E = {8} be the set of points defined by 
(1.11) 8B = —l/a 


as aruns through G. To each a (in G) corresponds a singularity ¢ = zr + a = 
x — B of (1.9), and conversely. If then we define D(x) by* 


(1.12) D(x) = max F + 4 = max|r— 8B, 
Q 


we have 
THEOREM 1.2. For every z, 


(1.13) lim sup | n!P,(x) |"" = D(z). 
n= 
This theorem is also true when A(u) is a branch of a multiple-valued func- 
tion, but in this case it is necessary to make precise those singularities of the 


complete analytic function “A(u)” (as we shall denote it) that are to be regarded 
as singularities of the particular branch A(u) given by 


(1.14) A(u) = } nta.u™™? 


From (1.4) we have 


(1.15) A( : ) = Cpe, 


l1—tr 


valid for x in any bounded region on taking | ¢| sufficiently small. Let r = 


be” = w + in be fixed, and consider the transformation 
t u 
1.16) “u= t= : 
( 1 —é&’ l+ur 
When ¢ describes the cirele | ¢| = r, u describes a cirele C, with the following 


radius and center: 
R=r|re—1{'; (—wr?/(r°8* — 1), or /(r°S* — 1)). 


Let B be the point —w + ¢», and let J be the ray issuing from O (the origin) 
through B, and U’ the ray in the opposite direction. Three cases are to be 
distinguished: 

(i) r < 1/8: the center of C, is on ray I’. 

(ii) r > 1/6: the center of C, is on ray L. 

(iii) r = 1/6: C, is a straight line (hereafter denoted by L,) cutting / at right 
angles at a distance 1/25 from O. The distance from the center of C, to O is 
ré|re—1i. 


* Point @ runs over G and 6 over EB. The set & is clearly bounded and closed, so that 
max | A} exists 
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Let I, denote the region in the u-plane into which |¢| < r maps under the 
transformation (1.16). It is then found that, according as we have cases (i), 
(ii) or (iii), J, is respectively the interior of C, , the exterior of C, , the half-plane 
(determined by L,) that contains ray Il’. In every case the origin (O) is in TJ, . 

In case (i), R regarded as a function of r is an increasing function, varying 
from 0 atr = 0 to ~ asr— 1/6. On the other hand, for case (ii), the radius 
decreases to 0 as r — ~, the limiting center being the point H: (—w + in)/® 
(on l). 

Let r increase from 0 to x. From 0 to 1/6 the transformed curves are circles, 
C,, inside C,, for r; < re, all lying on the origin side of L, , and filling out this 
half-plane. For r = 1/6 we get the line L,. Then for r from 1/6 to « the 
circles form a decreasing set, C,, surrounding C,, for r; < rz, all lying on the 
other side of L,, and shrinking down to the point H as r — «; and they fill 
out this other half-plane. 

Now consider the complete analytic function “A(u)”. For the given fixed z 
there is a smallest r = mr, (> 0) such that C,, passes through a singularity’ 
“ee” of “A(u)”; (i.e., at least one branch of “A” is singular at “a’’). We are 
to decide whether or not ‘“a’’ is to be regarded as a singularity of A(u) relative” 
to the fired x. What we mean by this phrase will appear shortly. We consider 
three cases. 

(i) 7, < 1/6. It is clear that series (1.14) can be continued analytically 
throughout J,, , so that branch A(w) is single-valued and analytic in J,,. If 
it is possible to continue this branch beyond C,, across “a’’; i.e., if there is a 
circle containing “‘@’’ and therefore overlapping with J,, , such that series (1.14) 
can be continued from J,, into this circle; then “a” is not to be regarded as a 
singularity of A(u) (relative to z). We shall say that the singularity “a’’ of 
“A” is passed over. If there is at least one singularity “a@’’ on C,, that cannot 
be passed over, such a point is a singularity of A(u) (relative to x), and the 
radius of convergence of (1.15) is then precisely r;. If all singularities “a’’ 
on C,, are passed over, there will nevertheless exist a smallest value r = re (> 0) 
such that on C,, there is at least one “a’’ that cannot be passed over. Such an 
“a” is a singularity of the branch A(u) (relative to z), and series (1.15) has the 
radius of convergence re. 

(ii) 7, > 1/6. Here series (1.14) can be continued from O throughout the 
whole exterior of C,, (including ~ ), and in this region J,, this branch is single- 
valued and analytic. The argument now follows the lines of case (i). 

(ili) 7, = 1/6. C,, is now the line L,,. If “a” is a finite singularity (of 
“A”) on L,,, the method of case (i) tells us whether or not “a” is a singularity 
of branch A(u) (relative to z). If there is no finite “a” on L,, , or if every such 
finite “‘a’”’ is passed over, there is yet the possibility that “a” = «. (If eisa 


7 We can ignore those singularities “a@’’ of “A’’ that lie interior to the circle of con- 
vergence of (1.14), since these cannot be singularities of A(u). Hence r; > 0. 
) 


* We shal! presently see that if ‘a’ 


is regarded as « singularity relative to one z then it 
ean be regarded as a singular point relative to all 2; i.e., it is independent of x. 
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singularity of “‘A”’ we consider it to lie on L,,.) It may not be possible to 
determine directly if branch A(u) can be continued across “x”. We then 
proceed as follows: 

If there exists an r > 1/6 such that A(u) is single-valued and analytic in I, 
(and therefore at «), then “©” is passed over. But if for every r > 1/4, 
A(u) does not remain single-valued and analytic in J, , then © is a singularity 
of A(u) (relative to 2). 

To sum up: For each fixed x there is as described above a smallest positive 
number r = r, such that the branch A(u) given by (1.14) can be continued 
analytically (and single-valuedly) throughout J,, but not throughout J, for any 
r>r,z. OnC,, there is at least one singularity “a” of “A” that is a singu- 
larity of A(u) (relative to x). The radius of convergence of series (1.15) is r. . 

Let G, be the set of those singularities of “‘A’’ on C,, that are not passed over; 
i.e., that are to be regarded as singularities of A(u) (rel. to x). And let H, 
be the set of those “‘a’s’’ lying in and on C,, that are passed over, and E, the 
set related to G, by the transformation (1.11). 

As x varies these sets can vary in their membership. Denote by G and E 
the respective logical sums of all the sets G, and all the sets E,. It is then seen 
that Theorem 1.2 is valid with this choice of G and E provided we show that a 
point “a’’ which is a singularity of A(u) relative to an x = 2 can never be a 
passed-over point relative to some other r = 22 ; i.e., that the logical product 
G,,-H,, is’ null for all x; and ze. 

Suppose it were possible to have an “a” so that G,,-H., = 0, and let C), 
C®? be the circles corresponding to x; and x: in the description above. Then 
“q’’ is interior to 7 but is on CS. The closed segment joining 0 to “a” lies 
wholly in ag and (except for “a’’) wholly in IS. If we continue A(u) (as 
given by (1.14)) from 0 along this segment, we can pass over “a” (relative 
to x). But this same continuation obviously applies relative to x, , so that 
“@’’ is passed over relative to x7,. Hence “a’’ is not in G,, , and this contra- 
diction proves that (1.13) remains valid. 

We shall call the locus D(x) = ¢ a level curve for the sequence {P,(x2)}. Con- 
cerning these level curves (to be denoted by J.) we have the following properties. 

(i) D(x) is a continuous function of x (for all finite x). 

This follows from the fact that D(x) is a distance function, the set EF being 
bounded and closed. Now D(x) — « as |2|— «©; hence D(r) has a minimum 
value” D,,. We shall see presently that the value D,, is taken on at only one 
point. 


* This amounts to saying that the set of singularities of A(u) relative to a given 2 is 
independent of x in the sense that no two x's will vield contradictory information. Ac- 
cording to our definition of G, we ean regard G@G as the absolute set of singularities of the 
branch A(u) given by (1.14). This is so even though there may be singularities ‘‘a’’ of 
“A” which never present themselves for test (by our above method) as to their passed-over 
or singular character relative to an x 

If F consists of a single point, D, = 0. In all other cases, D,, > 0 
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From what has already been said, J. (ec 2 D,») is a bounded, closed set. 

(ii) Fore > D,, , J. consists of more than one point. 

If not, suppose J, is the point 2. Let 2; be a point on Jp,, and 22 on J, 
wherea >c. (Clearly, J, is not-null for every a 2 D,,.) Then an arc joining 
x, to x2, and not passing through x», must contain at least one point of J.. 
This provides a contradiction. 

(iii) Fore > D,, , Jc has no isolated points. 

If otherwise, let z» on J. be isolated. Then there is a neighborhood of 2 
(e.g., a sufficiently small circle K) such that for no point x ¥ 2 in K is D(x) = c. 
Therefore either D(x) < ¢ for all x ¥ x in K or D(x) > ¢c for all rx ¥ amin K. 
Suppose D(z) < c. Let 8), in EB, be such that d(zxo , Bo) = | to — Bo| = D(a). 
If the segment joining 8» to x» be extended beyond zy , then for x on this exten- 
sion (and in K) we have D(x) 2 d(z, B)) > D(xo) = c; a contradiction. 

Now assume it possible to have D(z) > c. By (ii) there is a second point 
z, on J-, and we assume x, to be chosen the closest of all points of J. — 2 
to tm. (J- — 2 is a closed set.) Let C, C’ be circles of radius c and centers 
Z», Zo. All points of EZ lie in or on C and in or on C’, and therefore in the 
closed zone Z common to C and C’. But for every z on the open segment 
joining z, to x4 , the circle C, , with center at x and passing through the inter- 
sections of C and C’, contains all of Z in or on it; and its radius is easily found 
to be less than c, so that D(z) < ec. Since z can be chosen to lie in K, we 
thereby arrive at a contradiction. 

(iv) If 2; , 22 are any points on J. (ec 2 D,,), then every point x of the (open) 
segment joining x, to x2 satisfies the relation D(x) < ce. 

The proof follows from the latter part of the proof of (iii). Since c = D,, 
is not excluded we arrive at the result: 

(v) The locus Jo,, consists of a single point (denoted by x*). 

This is important when we come to consider expansions in {P,(x) }-series, 
for it tells us that all regions of convergence of such series have the unique 
point z* in common. Another corollary of (iv) is 

(vi) The locus J, cannot contain three or more collinear points. 

Let ly be any ray (i.e., a half-line) issuing from z* at the angle 6, 0 S< 6 < 2m. 

(vil) The ray ly meets each locus J, (c > D,,) once and only once. 

First we observe that since D(z) — « as |x| —> «, every locus J, (ec > D,) 
is met at least once by .. Suppose ly meets J, in two or more points, and 
let x; , 22 be two such. By (iv), if zs lies on ly between x, and x2 , then D(z3) = 
ce < ¢. Then by continuity there is an z, between z* and 2, (supposing 2 
closer to z* than is zz) for which D(a) = ec’; so that 2, , lying between x, and 2; , 
must satisfy the relation D(z,) < c’ < ¢, a contradiction. 

Consider the locus Je, ¢ > D,. If we introduce a polar coérdinate system 
(r, 6), using z* as the pole (r = 0), we have shown that for each @ there is one 
and only one r. That is, if we write the locus J, as r = F,(0), then F.(0) is a 
single-valued function of 6,0 = 6 = 29. We can however say more: 

(vill) The function r = FA0) is continuous for all 6. 
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For suppose @ = 4 is a point of discontinuity, and let rm = F.(@). Since 
F.(@) is a bounded function, there exists a sequence {6@,} —> % such that 
{r, = F.(6,)} has a limit 75 ¥ ro. Now 2, = (ra, On) isa point of J.. Hence 
by continuity of D(x), lim (r,, 6.) = (ro, %) = Misa point of J.. Now Xo 
is on ray l,, and x) ¥ 2. We thus contradict (vii), so the assumed discon- 
tinuity cannot exist. 

Combining these results we can state the 

THEOREM 1.3. The locus Jp,, is a single point x*. For everye > D,,, J-isa 
simple closed convex curve, containing in its interior the point x* and the locus J. 
for every c’ < ¢. 

If the set of singularities of A(u) is finite in number,” the loci J. consist of 
circular ares, each J. being made up of a finite number of such ares, of radius c. 
There can be ares of circles even in more complicated cases, but if A(u) has, 
for example, curves of singularities or regions of singularities, then the loci 
may not have this simple character. 

From Theorem 1.2 we immediately have 


TueoreM 1.4. Let lim sup | c¢,/n!|"" = p < 1/D,, and set c = 1/p. The 
series 
(1.17) F(x) = Do en Px (2) 
0 


converges absolutely for every x interior to J. , and converges uniformly in every 
closed region interior to J.. The sum function F(x) is analytic within J... 

One naturally inquires if there can exist points of convergence beyond J. . 
This is sometimes the case, and was indeed encountered in the treatment of 
equation (1) of the Introduction.” 

Before going on to the solution of the linear differential equation (2) of the 
Introduction, we wish to examine the exponential type of the funetion e“ L(t). 


Let L(t) be of exp. type p (< 2), and let e“L(é) be of exp. type r = rir). e® 
is itself of exp. type | a2 |. We have 

LemMMA 1.2. For all x, 
(1.18) r(x) = D(x). 


For the radius of convergence of a series > b,t" is the reciprocal of the exp. 
type of its BEF, > b,t"/n!. Now e“L() = BEF{LS) a! P,(a2)t"}, so that from 
(1.13), r(x) = D(a). 

From this follows 

Lemma 1.3. For all x, 


(1.19) p ir sr= D(r)Spt+ir 


’ ° ° e so 
For, the radius of convergence of A(t) = SS nta,t’'! is 1p, so that the a 
point nearest the origin satisfies the relation | a 1 p; therefore the 3 point 


" This was the case in our Transactions paper (already cited) 
® Loc, cit., Transactions, vol. 39 (1936), p. 355, footnote 
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farthest from the origin is such that |8| = p. Since D(z) = max |z — 8), 
we obtain 7(z) = D(z) S |x| + p. Again, the point 8 nearest to z satisfies 
the condition |@ — x| = |p — |x| |, as a diagram readily shows. Hence 
72|B—2z| 2|p—|2||. 

Both limits in (1.19) can be attained. Consider, for example, L(t) = e”*. 
Then r(p) = 2p = p + | 2], and r(—p) = 0 = p — | 2}. 


Part II. The associated linear functional equation: a semi-local solution 
Let 


(2.1) L@®) = Dat" 
0 
again be of finite exp. type p, so that 
(2.2) lim sup | n!a, |"" =p < @. 


n= 


We consider the linear differential equation of infinite order generated by L: 
(2.3) L{y(2)] = DX any(2) = F(z). 
0 


It is no restriction to suppose’ that a7 ~ 0. Then K(t) = 1/L(é) exists as a 
convergent power series in ¢, and if we define {Q,(x)} as the Appell set for K(é), 
we have 


(2.4) x[=| = Q,(2), 
or, what is equivalent (as is readily established), 
(2.5) L{Q.(2)] = =. 


From the relation between Appell set and generating function (see (4) of Intro- 
duction) we also have 


(2.6) K(de" = > Q,(x)t", 


so that if C is a contour around t = 0, lying within the circle of convergence of 
K(t), then 


‘i re e* 
(2.7) Q,(x) = Oni [ Lorn 


The set {Q,} is less suitable for expansion purposes than is {P,} of Part I, 
since the generating function for {Q,} is only of class (ii). But by altering 


13 For suppose a; = 0, 7 = 0, 1, ---, p — 1, ap ¥ O, and let Lift) = L(d)/l = 
Ap + Gpyit +---. If we can solve L,[y(x)| = G(x), then on differentiating p times we 
get L[y(x)] = G@@ (zx), so that (2.3) is solved on choosing G to satisfy G™(z) = F(z). 
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the contour C we are led to a sequence of functions much more serviceable 
than {Q,}. The method used is due initially to Hurwitz’ (who was concerned 
with the simple difference equation Ay(r) = F(x), F(x) entire). 

Since K(t) = 1/L(t) is a meromorphic function the right-hand member of 
(2.7) will define a function no matter how distant the contour C is from the 
origin, provided that C does not pass through any singularity of K(t). Choose 


such a contour C = C,,, of radius r,, center at the origin, and denote the 
corresponding function by Q,.,,, : 
(2.8) ee ee 

gs 2ri Jc, L(t 


On applying operator L to both sides of (2.8), and taking L under the integral 
sign (an operation which is easily shown to be valid), we obtain 


(2.9) L1Qur,(z)] = 7. 
n. 


We now examine the modulus of (2.8). From (2.1) and (2.2) we see that for 
every « > 0 there is an N = N, such that 
(2.10) M(r) = max | L(t)| < Ne?**”. 

|tj=r 

Again, there is the following theorem on entire functions:"° 

Let f(z) be of finite order and let k be any number exceeding one. There is a 
number H = H(k) such that for every R > 0 the closed interval (R, kR) contains 
at least one number r for which 


| f(z) | > [M(kr) 
for all z on \z\| =r. 
Here again M(r) is the maximum modulus of f(z) on | z | = r. 
Set k = 1 + 6,6 > 0, and apply the theorem to L(t). It gives 
M(r) > [M(r(i + 8))T", 


or 


(2.11) : 


LO. < [M(r(1 + 6))]", 


where, r’ being any number, the number r exists somewhere in r’ < r S r’(1 + 4). 
. , ~ 
As we are going to vary our contours C, , let us replace r’, r by r,, 7. We 
. Ui . r 
take r, (which depends on r,) as the radius of C,. Then from (2.8): 


[Qnr()| SAIMOra(L + ANIM es rte LM + I"; 
and from (2.10): 


(2.12) | Quir,(x) | < Bg? ftw 


4 Acta Mathematica, vol. 20 (1896-97), pp. 285-312. 
18 G. Valiron, Lectures on the General Theory of Integral Functions, Toulouse, 1923, p. 89. 
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+ Soe . 
Now r, is arbitrary. Let us choose 
, 
(2.13) fr, = dn, 


d being independent of n, and make use of the asymptotic relation 


nin (en)'("). 
e 


Then 

(2.14) n! | Qu.r,(2) | <N. ad ent Mea [s|-ta Cote +O]} 
and 

(2.15) lim sup | n!Qz,,,(x) ve < ean, 


Consider the right-hand member as a function of d. It is found that there 
is a minimum for d = {(1 + 4)[|z| + H(p + &(1 + 4)]}. In order to have 
d independent of x, we set x = 0. Furthermore, since ¢ > 0 is arbitrary, it is 
suggested that we choose 


(2.16) d = {Hp(1 + 4)*}". 
The right-hand member of (2.15) becomes 
Ho(1 + 8)? elem tisitaoG+ei— ty 


Now the left side of (2.15) is independent of «. We therefore obtain” 


(2.17) lim sup | n!Q,,,,(x) |" < Hp(1 + 6)?-e!?!/"ear, 
Let {c,} be a sequence with lim sup | ¢, |" = ¢, so that 
(2.18) lim sup | n!enQn,r,(z) |" S$ oHp(1 + 6)?-e!7!/"rr, 


The right-hand side will be less than one for |2z| sufficiently small if 
oHp(1 + 6) < 1. We ean therefore state 

THEOREM 2.1. Let L(t) be of exponential type p(< ~). There is a number 
A > 0, depending only” on L(t), such that if 


(2.19) lim sup {e¢, |" =0< 2 
pA 
then series : 
(2.20) De 2! en Quira(2) 
n=0 


converges absolutely and uniformly in some neighborhood of x = 0. 


16 Since 6 > O is arbitrary it might be thought that we can let 6— 0. But H depends on 
6 in a manner that is not specified in the theorem (in which H and k = 1 + 6 first appear); 
we do not therefore know what value of 6 will make the right-hand member of (2.17) a 
minimum. 

17 \ can be taken as the minimum (or greatest lower bound) of H(1 + 6)? for all 5 > 0. 
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Coro.iary. There is a number w > 0, depending” only on L(t) and o, such 
that (2.20) converges absolutely for all |x| < w, and converges uniformly in every 
closed region in |x| < w. 

When the exponential type p is zero, there is a marked simplification in the 
above theorem. For consider (2.15) where now we have set p = 0. Since 
e > Ois arbitrary, we can let e—0. This eliminates the term He(1 + 6). Again, 
since 6 > 0 is arbitrary, we can let6— 0. The left-hand member is indepen- 
dent of « and 6, so we obtain 


(2.21) lim sup | n!Q,.-,(2) |"" s aot, 
(2.22) lim sup | n!e, Qu.-,(x) |" Ss =e 


Now choose’ d = o (where for the moment we assume that o # 0). Then 
(2.23) lim sup | n!enQ,,,,(2) |" s e7!7!7. 


From this follows 

THEOREM 2.2. Let L(t) be of exponential type zero. To” everyo < ~ there isa 
sequence of contours C, such that if lim sup | c, |" = o then series (2.20) con- 
verges absolutely for all x in |x| < 1/0 and converges uniformly in every closed 
region in |x| < I/e. 

Lemma 2.1. Let L(t) be of exponential type p(< ~). Then 


(2.24) Liy(z)] = Do any (2) 

0 
converges absolutely in |x — %| < X¥ — p if y(x) is analytic in the circle 
|x — a | <A, (A > p); and the convergence is uniform in every closed region in 
|z—t| <A—o. 


To show this, let C be a circle with center at 2 and radius \’, where p < \’ < X. 
| < \’ — p we have 


For z in | x — 2x | 
tiles n! y(t) dt 
U n= 
y” (x) / 


2m Je (t — x)ntt? 
‘S If 6’, H’ are the values giving rise to A (see previous footnote), then w is defined by 
w = —pH'(1 + 3’) log [epaAl. 


This is seen by finding for what values of x the right-hand member of (2.18) is less than 
unity. If we set 
A = —pH'(1 + 3’), B = A log (pA), 
then we can write 
w= B+ A loga, 

where A and B depend only on L(t), not one. 

'® This means that the contours C,, , which serve to define the functions Q,,,,, (7), change 
with o, so that for different o’s we may be dealing with different sequences {Q,.,,(x)}. 

2° The preceding relation d = « which furnishes the proof requires that ¢ + 0, but Theo- 
rem 2.2 is readily seen to hold even if ¢ = 0. 
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and therefore 


n=0 (t — x)" 


1 [ (< nia, 
Lig = — i TOP» bat, 


since the series converges uniformly (and absolutely) for t on C and z in any 
closed region in |x — x2 | < \’ — p. On letting )’ increase toward \ we obtain 
the desired conclusion. 

Corotuary. Jf p = 0, then Lly(x)| converges in the neighborhood of every 
point a at which y(x) is analytic, and the convergence reaches out to the circle of 
convergence of the power series of y(x) about x. 

Lemma 2.2. With the hypotheses of Lemma 2.1, if 


(2.25) y(x) = Do y.lx — 20)", 
0 
the radius of convergence being X, then 
(2.26) Ll{y(z)] = > ynL l(a — x)"], 


the latter series converging uniformly in every closed region in |x — 2% | < %— p. 
. . " j1/ 
Since lim sup | y, |" = 1/A, to every « > 0 corresponds an a = a, such that 


|Yn| < a(t +6) ; 
y(x) Ka =| (i+ :) |x — || oe a - 
1-( )i2- 2 


Therefore on setting u = (1/A + 6) | 2 — a, 


hence 


“ n! | Ay | ( a :) 
2.27) Lly|] <<a ‘ 
(2.27) [y] > Gaur 
This is valid provided that | uw | < 1 (a condition which is true if z lies in any 
closed region in |x — 2» | < A, when ¢€ is chosen sufficiently small), and that 
|1—u\>p(l/A+.). This latter condition is fulfilled if 


\ — p — pre 
= ed 
‘hie, ith 


’ 


and from the arbitrariness of « we need only have | x — 2» | <A — p. Absolute 
and uniform convergence of (2.27) in any closed region in x2 — %| <A — p 
is immediate. From the absolute convergence it follows that the double series 


C-) c) c) (n) 
Lly] = L|> y.(z — ns)" => p> y(z — na} 


n=0 


k=0 
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converges absolutely, and may therefore be summed in any way. But one 
x 

way of summing is to take L under the 2-sign: Zz yn L\(x — 2)"]. This es- 
0 


tablishes the lemma. 
Coro.tuiary. If p = 0, then (2.26) is valid in |x — x \ < X. 


Lema 2.3. Let {u,(x) = ps bu(a — xo)‘}, n = 0, 1,---, be analytic in 
k=0 


/£ — xX <A, A> p, and let the double series > | Du(a — xo)" | converge for 


n,k=0 


\2—2%| <A. Then 


(2.28) Ap» vale | = > L{u,(x)], 


n=0 n=0 


both series being uniformly convergent for x in any closed region in |x — X%\ < 
A — p. 

The proof is patterned after that of Lemma 2.2, and need not be given in 
detail. Essentially it consists in showing that the double series 


x 

De anfuy” + ut” + us” + +. | 

n=0 
is absolutely convergent in | 2 — 2», < ’ — p (uniformly in any closed region 
therein), and may therefore be summed in any way. One such summing 
yields the right-hand member of (2.28). 

Now the series (2.20), with x replaced by + — 2», fulfills the conditions of 

Lemma 2.3 when o is so small that w > p. For, (2.8) gives us the series 


Qn.:,(@ — 2) = e& ne = ( : I e 3 it) 
tte "2 6 2nri Je, Lit) ; 


convergent for all x. We must therefore show that series 


ea) r— Xo k nic t* n-1 
¢ : Fy fect lt 
(a) x, ki! Oni [ Lio ‘ 


converges for |x — 2 | < w. On using inequalities (2.10) and (2.11) in the 
° . Ul . . 

manner already considered, and choosing r,, = dn, series (a) is found to be less 

than 


x 

y n Cr nd (1+6)| H(p+e)(14+-6)+|2r—279]} 
(b) N’ ys -€ ; 3" 
no (ed)” 


We can let « — 0. On again choosing d = |Hp(1 + 5)°}~*, we see that (b) 
converges provided x is such that oHp(1 + §)*.¢e! teltMoatOI"® < 1 Since 
\ = H(i + 5)’, it now follows that (a) converges for all x in |x — x2 | < a, 
as was to be shown. 








608 I. M. SHEFFER 


This gives us the relation 


i] nle,Qnur, (x o n)| = > nic, L [Qn (ax aa xo)]; 


and on using (2.9) (with x replaced by x — 2x), we have 
THEOREM 2.3. Let L(t) be of exponential type p(< x). Let limsup |¢,|°" = ¢ 
be so small that" w > p. Then 


(2.29) L| nn, Qn, (x ~— | = > c,(x — Xo)", 


n=0 


the left side being convergent in | x — x | <w — pand the right side convergent in 
®— 2%o| < I/e. 

Otherwise stated, Theorem 2.3 gives us a semi-local existence theorem for 
the equation (2.3): 

TuHroreM 2.4. Let L(t) be of exponential type p(< ©). If F(x) ts analytic 


about x = x in a circle of radius r > 1/o*, then equation (2.3) possesses an 
analytic solution y(x) in the neighborhood of x = x9. Morcover, if 
Bd) 
(2.30) F(x) = Does(x — a0)", 
0 
then y(x) has the form 
x 
(2.31) y(x) = n'c,Q,..,(@ — 2), 
0 
this latter series converging in |x — 2%) < w — p, where w is determined for the 
value” « = 1/r. 
We refer to this as a semi-local solution because our method does not permit 
the circle of analyticity of F(z) about « = a) to become smaller than 1/o*. 


There is however no reason to believe that this restriction, which results from 
the method used, is inherent in the problem. The question remains open. 


If p = 0 this restriction automatically disappears, so that we have 
THEeoreM 2.5. Let L(t) be of exponential type zero. To every function F(x) 
analytic about x = x» there is a function y(x), also analytic about x = xy, which 


satisfies equation (2.3); and if (2.30) has the radius of convergence r, then (2.31) 
also converges” for |x — a | <r. 

These solutions, semi-local or local as the case may be, permit us to return 
to the problem of Appell expansions. Let F(x) be analytic about x = 0 ina 


21 We saw that w = B + A logo where A, B are independent of ¢ and A <0. Hence 
limw = +* aso— +0, so that values of o exist for which w > p. Let us denote by o* that 
value such that ¢ < o* implies w > p. Then 

o* =e 


2 In particular, ifo = 0 then w = &, so that equation (2.3) possesses an entire function 
solution when F(x) is entire. 
23 In particular, if F(z) is an entire function, so is y(z). 
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circle of radius r > 1/0*, so that Theorem 2.4 holds, and let y(x) = > y,2" 
be the solution. Its radius of convergence is at least \ = w — p. Suppose 
\ > p, so that w > 2p. Then by Lemma 2.2, 


L{y(2)] = > ynL{[2") = > n!ynP,(2), 


where {P,(x)} is the Appell set generated by L(t). But L{y(x)] = F(z). 
Accordingly we have 

THEOREM 2.6. Let L(t) be of exponential type p(< ~«) and {P,(x)} the corre- 
sponding Appell set. If F(x) is analytic about x = 0 in acirele of radius r = 1/¢, 


where o is so small that w > 2p, then F(x) possesses the Appell expansion 
(2.32) F(x) = Dn! y,P,(2), 
n=0 


valid in the neighborhood of x = 0. Here \y,} ts the set of coefficients of the solu- 
tion y(x) = >> ynx" of (2.3) given by Theorem 2.4. 

This theorem is not as satisfactory as one might wish. It refers F(x) to the 
origin, whereas we know from Part I that the central point in P,-expansions is 
the point 2*. The difficulty resides in the semi-local character of the solu- 
tion of (2.3). 

When p = 0 this difficulty disappears. For then A(é) is an entire function. 
Its only singularity isa = «©, so that 8 = 0. Hence 2* = 0, and the level 
curves are concentric circles with center at the origin. Application of Theorem 
2.5 and Lemma 2.2 gives 

THeoreM 2.7. Let L(t) be of exponential type zero. Then every function F(x) 
analytic about the origin possesses a P,-expansion that is valid in the same circle 
as is the power series for F(z). 

This shows that the Appell expansions corresponding to a function L(t) of 
exp. type zero are markedly like ordinary power series in their convergence 
properties, 
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UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE SATISFYING 
THE FIRST DENUMERABILITY POSTULATE 


By L. W. CoHEN 


Theorems involving convergence, completeness and uniform continuity are 
consequences of what may be called the uniformity properties which inhere in 
metric spaces. It is perhaps of some interest to seek in spaces which are not 
metrizable similarly effective uniformity properties, in terms of which the 
theorems referred to may be reformulated.’ It is the purpose of this note to do 
this. The spaces considered are the topological spaces of Hausdorff satisfying 
the first denumerability postulate, namely, that to each p C S the topology 
at p is determined by a sequence of neighborhoods U’,(p). The uniformizing 
entity is the class [U,] of all U,(p) for fixed n and all p C S. We make the 
following definitions: 

1. A sequence p, C S is a Cauchy sequence if for each n there is a k, and a 
qn © S such that p. C U,(q,) if k > k,. 

2. A space S is complete if every Cauchy sequence has a limit. 

3. Aset M C S is totally bounded if for each n there are points 


Pay Pn.2,°** » Pum, 
such that 


MC > U,(pn,:). 
i=l 
4. A function f on M C S to S’ is uniformly continuous on M if for each n 
there is an m(n) such that if p C M 


SIMU mem (p)] SC ULC (p)). 


It is clear that these definitions become the usual ones for metric space with 
spherical neighborhoods. 

The justification for these generalizations is to be sought in the theorems 
that a set M in a complete space is compact if and only if it is totally bounded, 
and that if F is uniformly continuous on M C S to a complete space S’, then 
there is a function F* on M to S’ identical with F on M and continuous on M. 


Received June 2, 1937; presented to the American Mathematical Society, November, 
1935, under the title Cauchy convergence in non-metric space (preliminary report), and 
April, 1937, under the title Uniform continuity in topological space. 

1 The relation between completeness and compactness is discussed by J. von Neumann, 
Annals of Math., vol. 36 (1935). Recent notes on the subject have been published by 
Garrett Birkhoff, Annals of Math., vol. 38 (1937), pp. 57-60, and by L. M. Graves, Annals 
of Math., vol. 38 (1937), pp. 61-64. 
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It will appear that for certain of these results a further restriction must be 
placed on the spaces, but it does not imply metrizability. 
THEOREM 1. If Sis complete and M C S is totally bounded, then M is compact. 
Proof. Let px be a sequence of points of M. Then 


(py) CMC > U1s(qu..) 


for some points qi1, 91.2, °--, Qi.m, in S. Denote by Ui(q:) one of the 
Ui(q,:) containing a subsequence p,” of py. Assume that U,(q,) contains a 
subsequence p;"’ of py. Then 


Mn+1 


(p;”’) Cc MU ,(qn) S p> U n4s(Qn4s.a) 


for some points Gn41,1, Qndt.25 °° + Qnttomgs,;- Denote by Unsi(@ns1) one of 
y ° . -1) ) T 
the Unsi(Gnsi,:) Which contains a subsequence p,"*” of px”. We now have a 
(n) 


sequence of neighborhoods U,,(q,) and a sequence of sequences py” such that 
Un(gn) D (pk) D (px"*™”). 
The diagonal sequence py = p;"’ satisfies the condition 
Pe Un(Qn), k>n. 


ps is a Cauchy sequence and, S being complete, M is compact. 

It is to be noted that the notion of Cauchy sequence is not topologically 
invariant. This is the case in metric space also. Under uniformly continuous 
mapping, however, we do have invariance. 

TuHEeoreM 2. If f is uniformly continuous on S to S’ and p, is a Cauchy se- 
quence in S, then f(p.) is a Cauchy sequence in S’. 

Proof. Consider n and m(n). Then pre C Umny(p) for some p C S and 
k>k,. Hence f(pr) C f[Umin(p)] C UL(f(p)) for k > ka. This being so for 
all n, f(p,) is a Cauchy sequence. 

We impose on S the 

PostuLaTe. To cach p C S and positive integer n there ts an integer m(n, p) > 
0 such that if q © Umcn.»)(p) then p € U,(q). 

TueoreM 3. If M C S is compact, then M is totally bounded. 

Proof. Assume that M is not totally bounded. Then for some n, M is not 
contained in the sum of a finite number of U,(p). For each k and 


Pi, D2, +++, Pr CM 
there is 


, 
micM—-M>U,(p), 
t=1 


M being compact, a subsequence p,, of p, has a limit p C S so that 


Pk, Cc U ;(p), t> t;. 
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Choosing 7 = m(n, p) and t > tnn,»), we have p C U,(px,) and 
pe CS Un(pe,) 


for infinitely many k. Let px, = p, and r be the smallest integer greater than s 
for which p, C U,(px,). We have 


r—1 8 
ppCM —M YX U,(p) C M — MD U,(p) C M — MU,(p,) 
i=l t=1 

and a contradiction. 

With S subject to the postulate given above and S’ regular and complete, 
we have 

THeoreM 4. If f is uniformly continuous on M C S to S’, then there is a 
function f* on M, the closure of M, such that f = f* on M and f* is continuous 
on M. 

Proof. To each p C M there is a sequence p, C M such that lim p, = p. 
Consider, for p and n, the number m(n, p) of the postulate. Since 


Pk Cc U mn,» (P) 


for kn > Kmin,») , Pp © Un(pe,) for each n and some k, . 
Since f is uniformly continuous on M and p, C M, we have 


S(px) CS [MU ncn) (Dim cny)] Cc URS (Pim eny))s k > K min) 5 


so that f(p,) is a Cauchy sequence in S’. S’ being complete, there is p* C S’ 
such that lim f(p,) = p*. If lim q = p and q C M, there is the sequence 
r, © M whose terms are alternately p, and g, which has p as limit. Hence 
lim f(r:x) = lim f(q) = lim f(px) = p*. Thus for each p C M and every se- 
quence p, C M with limit p, lim f(p,.) = p*. This defines a single-valued map- 
ping f*(p) = p* on M to S’. 

If p C M, then pp = p C M and f*(p) = lim f(p,) = f(p) so that f* = f 
for p © M. If f* is not continuous on M, there is a p C M and an n such 
that for every m, MU,,(p) D p» such that f*(p,.) C S’ — UL(f*(p)). We may 
choose p,, so that 


Dm Cc MI T stm) Cc M IT U,(p). 
= 


Since S’ is regular, there is a vy such that U/(f*(p)) C U,(f*(p)). Thus for 
all m 


f*(pn) CS’ — Un(f*(p)) CS’ — Ti(f*(p)) C 8’ — UL(f*(p)). 
Now pn C M so that there is a sequence qm,, C M such that lim; qm,x C pm- 
Hence gn,x C [] U,(p) if k > kn. Further limg f(qn,x) = f*(p») implies 
a= 


Sm.) CS’ — Oi(f*(p)) ifk > ki. 
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Hence for k(m) > max (km, ky) we have 
ami © IL UW), — $Qn.rom) © 8! — Ti(F*@)) C8" — ULF). 
=~ 


Now lim». Gm,k(m) = Ps Gm,kim) & M and the closure of S’ — U}(f*(p)) yield 
lim f(qm,xcm) = f*(p) CS’ — UL(f*(p)), 


which is a contradiction. Thus f* is continuous on M. 

We give an example of a non-metrizable topological space S satisfying the 
first denumerability axiom and our postulate. The points of S are the points 
p(x, y) of the Cartesian plane. Let S(p, r) be the interior of the circle with 
center p and radius r. The neighborhoods of po(0, 0) are defined by 


U.(po) = s(m, ‘) — {p}, 
n 
where p(z, y) has x > 0, y = 0. If p(z, y) has y ¥ 0, U,(p) = S(p,n'). If 
p(x, 0) has x > 0, then U,(p) = S[p, (k + n)~'], where k is the smallest integer 
such that the distance from p to po is greater than k’. The space fails to be 
metrizable because it is not regular at po . 

By adding the restriction of regularity to the space we can continue the 
theory to include Baire’s theorems on category and on the distribution of 
points of continuity of the limit function of a sequence of continuous func- 
tions. These theorems are of considerable importance in existence proofs in 
functional analysis. In the subsequent discussion it will be assumed that the 
space S is regular as well as complete in the sense defined earlier. 

THEorEM 5. If A,, is a sequence of sets in S such that A,.; is dense in A, and 
G,, is an open set containing A, , then [J G,, is dense in Ay . 


Proof. Consider p, C A; and any U,(p:). There is U,,(p1) C U,(p)G, . 
Then Ui(p:)U»,(g1)A2 D pe and there is a neighborhood U’,,(p2) such that 


(2) Un, (pe) c Ui(pi) Un, (r)Ge « 


If px © A, and U,,(px) are defined, the Ux(p.)Un, (pe) Aras D pers and there is 
a U,,,,(per) such that 


(k + 1) O ngs, (Desa) CS Un (pe) Un (pe )Gesr « 
From the relations (k), we have 
Pkim = UO nns0 (Prim) Cc Ux(pe), m = 0. 


Thus p, is a Cauchy sequence and, S being complete, lim p, = p exists. Also 
from (k) it follows that 


IV 
—) 


Pkim = CU ., (pe) sea (i, , m 
so that p CU,, CG; for allk. Thus p CJT] G.U,.(p:) and [J G; is dense in A. 
k is 








614 L. W. COHEN 


THEOREM 6. If A, is a sequence of Gs sets in S each dense in every other one 
then [] A, is dense in each A,,. 


n 
Proof. Each A, = [J] G,., where G,,,, is open. Let 
m 


Gy = Gia, Ge - G21, ~#is , O = oe vas 


be the enumeration of the G,,,, by diagonals. If we set B, = A, when G, = 


Gnn, then G, D B, and B,,; is dense in B,. Hence II G, = Il Gan = II A, 
k m,n n 


is dense in A; by Theorem 5 and consequently in every A, . 

We have a consequence in 

THEOREM 7. If M is a G; in S and A, in a sequence of sets open in M and 
dense in M, then II A, ts dense in M. 

Proof. Each A, , being open in M, is a G; set in S. Each A,,, being dense 
in M, is dense in every other A,. Hence by Theorem 6 II A,, is dense in A; 
and also in M. ws 

From this it follows that S is of the second category. In fact we have 

THEOREM 8. If M is a G; in S and A,, is a sequence of subsets of M each 
nowhere dense in M, then M — >> A, is dense in M. 


Proof. If F, = MA, and G, = M — F,, then G, is open in M and dense 
in M. By Theorem 7, [] G, = M — >> F, is dense in M, hence M — }> A, D 
M — > F, is dense in M. 

This leads to the theorem of Baire on the continuity properties of the limit 
function of a sequence of continuous functions. 

THEOREM 9. If A is aG; in S and f,(p) ts a sequence of functions continuous 
on A to a metric space R, having the limit function f(p), then the set of points of 
continuity of f(p) ts dense in A. 

Proof. For a given n > 0, let Ax be the subset of A on which 


dl f(p), fx(p)] <0 ifn > k. 
Then A = >> A; since f is the limit of f, on A. By Theorem 8, at least one 
k 


of the A, is not nowhere dense in A. Hence for some k and some non-empty 
A* open in A, Ay D A*. 
Let q be any point of A*. Then for some m > k 


d[f(q), fm(Q)] < 1, 
since f(q) = lim f,(q). Now every U,(q) contains a point p C A, and for such p 


d[f(p), fr(p)] < 1, ifn > k. 


For the given m and each n > k, there is a U,(q) such that 


A[fn(P), fm(Q)] < 2, Al fn(p), fn(Q)] < 0, ifp C U@)A, 














UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE 615 


because f, and f,, are continuous at g. Further, since m > k, 


d(f(p), fm(p)] < 2, if p C Ax. 


Hence for the given m and each n > k and an existing p C U,(q)U,(q)Ax, 
we have 


AL F(9), $n(Q)] S ALF), Fm(Q)] + A fm(), fm(p)] + dl fm(p), f(p)] 
+ d[f(p), fx(p)] + al fn(p), fn(Q)] < 5m. 


Now consider any « > 0, » < fe and the corresponding A*. Since A* is 
open in A, there is, for each g C A*, a neighborhood U’,(q) such that 


d(f(p), fu(p)] < «, ifn >k, p CUA. 


Let A(e) be the set of all g in A satisfying this condition. Then A(e) is open 
in A and not empty. The function f(p) is continuous at each point of [] A(n™). 


These statements remain true if the set A is replaced by any set B open in A. 
=e © ° ~~ =§, 
Hence A(n‘') is dense in A for each n and, by Theorem 7, [] A(n™’) is dense 
n 


in A. Thus the set of points of A at which f(p) is continuous is dense in A.” 

As an example of a regular Hausdorff space which satisfies the first denu- 
merability axiom and is complete but not metrizable we have the space 7’ whose 
points are the ordinals of the first and second classes and whose neighborhoods 
are open segments of the ordered set 7’. 


UNIVERSITY OF KENTUCKY. 


2 It is to be noted that the proofs of Theorems 5-9 are parallel to those for metric spaces. 
Cf. Hausdorff, Grundziige der Mengenlehre. 








SOLUTIONS OF SYSTEMS OF DIFFERENTIAL EQUATIONS IN TERMS 
OF INFINITE SERIES OF DEFINITE INTEGRALS 


By JrEssE PIERCE 


Introduction. The general solution of a differential equation of the first order 
and first degree can be found in terms of an infinite series of definite integrals.’ 
The definite integrals appearing in the solution are solutions of linear differen- 
tial equations. 

The same method is applicable to finding the general solution of a system of 
differential equations. The functions giving the solution, however, are ex- 
pressed in terms of infinite series of solutions of systems of linear differential 
equations. The general solution of each system of linear differential equations 
can be found in terms of infinite series of definite integrals.” 

Hence the solution of the original system of differential equations is ex- 
pressed in terms of infinite series of infinite series of definite integrals except in 
the case where the original system is linear or the system comprises just one 
differential equation of the first order. 

In the present paper the general solution of a system of differential equations 
will be found in terms of infinite series of definite integrals in which every 
integrand consists of a finite number of terms. 

The system of differential equations to be considered has the form 


dy; , 
a: a a E. Sxs cal 
(1) dt gilt, Yi, ’ Yn) (i 1, » Nn), 





where the g(t, y:, --- , yn) are analytic in the y; (j = 1, --- ,m) and have as 
coefficients a,,...,, functions of tf which are integrable (Riemann) and satisfy 
the inequalities 


(2) | Gins---m, | S f(u), 
f(u) being a positive integrable function of u, the arc length of a rectifiable curve 
drawn from the origin to the point t. The exponent of y; (j = 1, ---,m) in 


the expanded form of the ¢; is represented by u;. The independent variable 
t is assumed to be of the form 


(3) t = g(u) + V—-1 Hu), 
where ¢(u), ¥(u) are real functions with continuous first derivatives. 


Received June 9, 1937. 

1 Solutions of a differential equation of the first order and first degree in terms of infinite 
series of definite integrals, presented by the author to the Ohio Section of the Mathematical 
Association of America, April, 1937. 

2 Solutions of systems of linear differential equations in the vicinity of singular points, 
American Mathematical Monthly, vol. 43 (1936), pp. 530-539. 
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Each unknown y; is obtained as the sum of a series yx (hk = 1, 2,--- ), 
whose terms are found by integrating, sequentially, certain polynomials in the 
yx (9 = 1,---,n;k = 1,---,h — 1). 

For convenience, the sequence 4; , --- , Zn Will be represented by » and hence 
the coefficient aj,,..., will be represented by ai, . 

In §1 the formal solution of the system of differential equations (1) is found 
and the convergence of this solution is proved in §2. In §3 a more general 
system of differential equations is reduced to the form (1) by a simple trans- 
formation. 


1. Formal solution of the system of differential equations (1). Equations 
(1) can be written in the expanded form 


dy; : a 
QB A= MO +L auld-vs + Ze awOvive ye =A m) 


where v = mw, + --- + mwa. 
When the y;, in equations (4), are replaced by 


(5) Yi = Dyk’, 


there results 


2 . d m ~ = “i = rh ) 
YK = a) +X alt) D Kya + DK Fall, vs) 
6) elie n> ae ins 


(@ =1,---,n;k =1,---,hk—1), 


where the ‘F(t, yj) are polynomials in the yj, of degree h and each coeffi- 
cient is one of the a;,(t). The parameter K is introduced in order that a con- 
venient arrangement of the terms in the right-hand members of equations (6) 
can be made. 

A formal solution of equations (6) can be found by replacing K by unity and 
then solving the following system of differential equations: 





(dyin a J 
dt ae a,(t), 
dyiz . 
(7) 4 = = poe ai(tyn, 
t j=1 
dyin . 5 
= a(t) Yj ra + Sinalt, yx) (h = 2,3, ---). 


Equations (7) are obtained by equating the coefficient of K” in the left-hand 
member of (6) to the coefficient of K*™ in the right-hand member. 








618 JESSE PIERCE 


Equations (7) have the formal solution 


ya = [alae + 6 = mld, 


~ 
te 
Il 


(8) | 2 a(t) auld) dt = nald), 


| Ya = [ b 6:(t) nina) + Firalt, nal) | dt = ni(t) 


(k =1,---,h—2;h = 2,3, ---), 


where the ¢; are arbitrary parameters. 
The set of functions defined by the infinite series 


xz 

(9) = py nin(t) (i _ 5, site n), 
h=1 

is a formal solution of the system of differential equations (4). 


2. Proof of the convergence of the series (9). A particular case of the system 
of differential equations (1) is the system 


dy;  —s f(u) 
(10) du a — > Y; (i _ l, i n), 
j=l 


whose right-hand members dominate those of (1). The system (10) hasa 
solution in which all of the Y; are equal, that is, the Y; all satisfy the equation 


dY fu) 


du 1 —nY" 


(11) 


The general solution of the differential equation (11) is 


= 1 — (1 — 2n[G(u) + c})! 


(12) Y 

n 
where 
(13) G(u) = [ f(u) du, 


and ¢ is an arbitrary parameter. We shall consider ¢ real and non-negative. 
The minus sign is used before the radical in order that all of the terms in the 
right-hand member of (12), when expanded in a power series in [G(u) + cl], 
be positive. This expansion has the form 


n{iGlu) + cP. [GQ + ec} 
4 + - or 


(14) Y =[G(u) +e] + = bn" "[G(u) + cl)’, 
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where the b, are real positive constants. The series (14) will converge when 
(15) 2n[G(u) + ce] < 1. 


The power series (14) can be found directly from the differential equation 
(11) by expanding the right-hand member of (11) in a power series in Y and 
then making the substitution 


(16) Y=) Y.K". 
h=1 
The result can be arranged in the form 


bs sed K* = f(u) + flu) > nY,K" 
h=1 h=1 


(17) + Sw) 2X K" (nh {¥iYaa + ++ + YeaYi} +--+ 4+ n' Y1] 


= f(u) + f(u) ps K'nY, + > K'F(u, Y,) (k =1,---,h — 1). 


It follows from the inequality (2) that the polynomial F,(u, Y;) dominates the 
polynomial ‘Fi,(t, yx). 
The system of differential equations corresponding to (7) is 


(dY 

endl 

| du flu), 

—dY2 _ . 

= = f(u)n¥i, 
(18) } ay 
3 es # 2,72 
me f(u) [n¥2 + n° Yil, 
| = = f(u)[nYiu + Frau, Y,)) (k =1,---,h —2;h =3,4,---). 


Equations (18) can be solved in terms of indefinite integrals in the form 


Y,= | s00 du = G(u) + ¢ = H,(u), 


Y.= | seonti du = 4n{G(u) + c} = He(u), 


| 

| 

| 
(19) ¢ 
| Y; = [ so) nttse + n°Hi(u)} du = 3n[G(u) + cl’ = H,(u), 
| 


Ys = / [f(u)nHya(u) + Fia(u, Hi(u))] du = an[G(u) + c]" = H,(u), 


where the a, are positive constants. 
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The series 
(20) Y = > A,u) 
h=1 
is a power series in [G(u) + c] and hence is precisely the same as the series (14). 
When equations (18) are solved as in §1 in terms of definite integrals, one 


obtains 


Y,= [500 du+ec= Glu) +c = Z,(u), 


} u 
|Y¥;= I f(u)nZ,(u) du = 4n[G(u) + ce} — 4nc® = Z,(u), 
0 


(21) 
| Y= [ seotnzatey + n°Zi(u)|du = 3n[G(u) + ec} — gn’e'G(u) — gnic® 
= Z;(u), 
By comparing equations (21) with (19) it is clear that 
(22) Z(u) = H,(u), Zi(u) = Hi(u) (h = 2,3, ---) 
It follows from equations (21) and (8) that 
(23) | nin(t) | S Zs (u) (h = 1,2,---), 
provided 
(24) le;| Se. 


Hence the series (9) will converge when the inequalities (15) and (24) are satisfied. 

Since the series (9) are absolutely convergent and the independent variable ¢ 
satisfies the relation (3), the functions y; have derivatives of the first order and 
satisfy the system of differential equations (4) for all values of t on the path of 
integration except at the set of points of measure zero at which the coefficients 
a, are discontinuous. 


3. Systems of differential equations with more general coefficients than those 
of (4). Consider the system of differential equations 


] ’ n id . 

(25) = = 6,(t) + a(t)a; + > 6;(t)a; + DX falta a --- 2 (¢ =1,---,n), 
i= = 

where uw represents the sequence wu, ---,4, andy = pw +--- + wr. The 


function 6(¢) is assumed to have the indefinite integral 


(26) | A(t) dt = Bit), 

















“ 
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which may become infinite at a set of points of measure zero on the path of 
integration defined in §1. The functions @;(t) have the form 


(27) a(t) = ¢(HE, 
where the ¢,(t) are integrable (Riemann) and satisfy the inequalities 
(28) ft) | Sf), 


for all values of t on the path of integration. The functions 6;;(t) are integrable 
and satisfy the inequalities 


(29) 9:0 | S flu), 


for all values of ¢ on the path of integration. The functions fi,(t) have the 
form 


(30) fil) = —_— (v = 2, 3, ae -), 
where the ¢;,(¢) are integrable, and satisfy the inequalities 
(31) | ful) | S flu), 


for all values of ¢t on the path of integration. 
The transformation 


(32) i = PO y;, 


reduces the system of differential equations (25) to the form (4) and hence the 
system (25) has the solution 


(33) i= ” > nin(t), 


where the y(t) are defined by equations (8). 
When the real part of the function 6(¢) approaches minus infinity as ¢t ap- 
proaches t’, a point on the path of integration, then 


(34) lim z(t) = 0. 


t—t’ 


When the real part of the function 8(¢) approaches plus infinity as ¢ approaches 
t’”, then 


(35) lim z(t) = 2, 


tt’ 


provided 


(36) lim } 2 nia(t) ¥ 0. 


tt’ h= 
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Conclusion. The coefficients of the differential equations (4) and (25) can 
satisfy the assumptions of §§1 and 3 without being continuous or bounded with 
respect to t. 

The solutions (9) and (33) are the general solutions of the systems (4) and 
(25), respectively, because they contain n parameters c;, which are arbitrary 
except for the inequalities (15) and (24). 

In the case where the limit of the real part of the function @(¢) is minus 
infinity when ¢ approaches zero, the initial values of the dependent variables 
x; are zero for every set of values of the ¢; . 


HEIDELBERG COLLEGE. 

















NON-n-ALTERNATING TRANSFORMATIONS 


By D. W. Hai anp G. E. ScHWEIGERT 


Let A and B be compact metric spaces and 7(A) = B a single-valued con- 
tinuous transformation. We shall say that T is non-n-alternating provided 
that, for any point x of B for which there exists a cutting K of A — T™'(x) 
consisting of at most n points, there is no point y of B such that 7~'(y) inter- 
sects both sets of the separation A — (T7'(x) + K) = A: + Ae. If K is the 
null set, this is the definition of a non-alternating transformation.' Conse- 
quently, this type of transformation is non-alternating; in fact, we have the 
following characterization: 

THEOREM I. A necessary and sufficient condition that a single-valued con- 
tinuous transformation T(A) = B be non-n-alternating is that T be non-alter- 
nating on the complement of every subset of A consisting of at most n points. 

Proof. Let x and y be points of B and K any subset of A consisting of at 
most n points. If T\(x)-(A — K) separates” T'"2.t4 ~ Hid - &, 
ie., if (A — K) — T(2)-(A — K) = Ai + As, T (y)-(A — K)-4; # 0 
(i = 1, 2), then this separation may be written in the form (A — 7”'(x)) — K = 
A, + Ae. Hence K separates T~'(y) in A — T~‘(x), contrary to the definition 
of non-n-alternating. Thus the condition is necessary. 

To establish the sufficiency, we notice that if there exist two points x, y in B 
and a cutting K of A — 7” '(x) consisting of at most n points such that 7” '(y) 
intersects both the sets A; and A» of the separation A — (T 2) + K) = 
A, + As, then (A — K) — T'(x)-(A — K) = A; + Ag and therefore 
T ‘(y)-(A — K) is separated by T'(x)-(A — K) in A — K. Consequently, 
T is not non-alternating on A — K. This proves the sufficiency. 

Lemma. Jf 7T(A) = B is non-n-alternating, B is non-degenerate, y « B, and 
two points of T”'(y) are separated in A by a cutting K consisting of k < n + 1 
points, thenk = n+ 1 and T(K) = y. 

Proof. If k S n, then T is non-alternating on the complement of K, by 
Theorem I. But this is impossible since 7 '(y) intersects two components of 
this complementary set. Thusk = n+ 1. If 7(K) # y, there exists a point 
pin K such that 7(p) ¥ y. Then the set of n points (K — p) separates 7 '(y) 
in A — 7 '(T(p)), contrary to the fact that 7 is non-n-alternating. There- 
fore, T(K) = y. 

One consequence of this lemma, namely, the fact that a point of order not 


Received June 12, 1937. 
1See G. T. Whyburn, Non-alternating transformations, American Journal of Mathe- 
maties, vol. 56 (1934), pp. 294-302. 
2 If L and M are subsets of N, we say that L “‘separates’’ M in N provided M is con- 
tained in N — Land N — L = N,+ Ne, where NiN, =0 = NiNzand MN, #0 # MN,z. 
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greater than n in A is equal to the inverse of its image provided B is non- 
degenerate, suggests the following theorem. This theorem covers, as a special 
case, the class of regular curves of order n + 1. 

TueoreM II. If A is a compact metric space every pair of points of which is 
separated by a set containing at most (n + 1) points and T(A) = B ts non-n- 
alternating (n > 1), then T is a homeomorphism on A provided B is non-de- 
generate. 

Proof. We first show that the inverse of every point b of B is an A-set.° 
Let G = 7° '(b); then G is closed and lies in a single component H of A. From 
the hypotheses of the theorem it follows that H is a locally connected continuum. 
Consequently, if p and q are any two points of G, there is a simple are pq joining 
these two points in H. If we assume that there is a point z in pq which is 
not in G, it follows that there is a last point y of G in the are from p to z and 
a first point z of G in the arc from x to g. Thus the open are yzz contains no 
points of G. By hypothesis, there exists a cutting K consisting of at most 
(n + 1) points and separating y and z in A, so that some point of K must lie 
on the open are yrz. Hence T(K) is not 6, contrary to the lemma. It follows 
that G contains every simple are joining two of its points, and hence, being 
closed, it is an A-set. 

It is also a consequence of the lemma that no set of n points separates G, 
so that this set lies in a true cyclic element C of A; therefore G = C, since C 
is a minimal A-set.* The set C now has the property that any pair of its points 
‘an be irreducibly separated by (n + 1) points. It follows’ that C cannot 
exist except as a single point. Thus 7 is 1-1 and hence a homeomorphism. 
This completes the proof of the theorem. 

Under the same hypotheses for n = 1 the above proof holds except for the 
non-existence of the true cyclic element C. If C exists, it must be a simple 
closed curve, and the image space B is not only a boundary curve,’ but it is 
homeomorphic with the original curve, provided that no true cyclic element of A 
has a degenerate image. 

If n = 0, the transformation is non-alternating and acts on a space having 
dendrites as components; this situation has already been discussed in the 
original paper on non-alternating transformations.’ 


3 That is, 7~'(b) is closed and contains every simple are joining two of its points in A. 
See Kuratowski and Whyburn, Fundamenta Mathematicae, vol. 16, p. 309. 

4 See Kuratowski and Whyburn, loc. cit. 

5 This follows from the theorem that for n > 2 there exists no continuous curve M such 
that for every pair of points P and Q of M there are exactly n independent ares from P 
to Q, but there does not exist for any pair of points (n + 1) such ares. This theorem has 
been proved for the cases n = 3 and n = 4 by Kusner and published in the Comptes Rendus 
des Séances de la Société des Sciences de Varsovie (1932). It has been established for 
the general case, but not yet published, by J. R. Kline. 

® That is, a compact locally connected continuum each true cyclic element of which is a 
simple closed curve. This term has been suggested by G. T. Whyburn, loc. cit., p. 301. 
It follows that a boundary curve is a compact continuum every pair of points of which is 
separated by at most two points. Here, this characterization is needed rather than the 
definition. 
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From this same source we learn that the property of being a boundary curve 
is invariant under non-alternating transformations. Since this result repre- 
sents in a certain sense a sharper form of the present theorem, it might be sus- 
pected that for a compact metric space A the property of being separated 
between each pair of its points by at most (nm + 2) points is invariant under 
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non-n-alternating transformations. That such is not the case can be shown 
by simple examples. However, the failure of the purest form of the analogy by 
no means denies the possibility of other closely connected results. For example: 
Is the non-n-alternating image of a regular curve of order (n + 2) hereditarily 
locally connected? 

The question as to whether or not the property of being a regular curve is 
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invariant under non-alternating transformations was raised by G. T. Whyburn 
and offered difficulties which led to this and to at least one other specialized 
type of non-alternating transformation. In answer to this question we may 
now make the following statement: There exists a regular curve A of order three 
and a non-alternating transformation’ T(A) = B such that the inverse of any 
point of B is at most three points, but B is not hereditarily locally connected, hence 
certainly not a regular curve. 

Example. Let Py denote a square on (0, 1) and for each positive integer k 
let P;. be the square on (0, 1) which lies above and makes an angle & = x 
with P,). Consider, for future reference, the intervals with end points 
3° +37" and3 "*r + 2-3" (m = 0,1,--- 37 = 0,1,---, 3" — 1), 
and denote each such interval by J,,,,. In P;, for k = 3” + r, make the 
following construction: (i) on the base J,,,, erect a square and denote the 
side opposite the base by s; (ii) using s as the middle third of the hypotenuse 
construct an isosceles right triangle which is disjoint with the interior of this 
new square. 

The space A is the sum of the boundary of P» and all the figures constructed 
in (i), (ii) above. The transformation T(A) = B will consist of a simple 
identification of all the squares P; with the particular square P» , i.e., let 6, = 0 
for all k. The image space B may be constructed as indicated by the figure. 
We have at once that 7 is one-to-one on A except at the ends of the hypotenuse 
of each triangle constructed in (ii) and at certain obvious points on the lines 
perpendicular to the basic unit interval. If p is any point of A such that 
T ‘(T(p)) # p, then p is not a point of the basic unit interval. Consequently, 
p lies in a non-degenerate cyclic element of A-P,, where pe P,. No other 
point of 7 '(7(p)) is in P,, hence T”'(T(P)) does not separate A-P;,. It 
follows easily that no inverse set separates A, i.e., T’ is non-separating, hence 
surely non-alternating. It will also be observed that the bases of all the tri- 
angles constructed on squares of the same height are joined in B to form an 
interval of unit length, and that the sequence of intervals thus formed has the 
unit interval as a continuum of convergence. Thus B is not hereditarily locally 
connected. The remaining properties of 7, A, and B are easy consequences of 
their respective definitions. 


UNIVERSITY OF VIRGINIA. 


7 In fact, this transformation is non-separating in the sense of Wardwell, i.e., for no 
b « B does T~'(b) separate A. See James F. Wardwell, Non-separating transformations, 
this Journal, vol. 2 (1936), pp. 745-750. 
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RESIDUATION IN STRUCTURES OVER WHICH A MULTIPLICATION 
IS DEFINED 


By Morcan WarpD 


I. Introduction 


1. Consider a set of elements A, B, C, --- forming an abstract structure’ = 
over which there is defined a commutative and associative multiplication XY. 
The multiplication operation is connected with the structure by assuming that 


it is distributive with respect to union: 
(3.3) A(B, C) = (AB, AC) for A, B, C in &. 


This condition is satisfied in the important instance of the ideals of a com- 
mutative ring. 

If we assume that any subclass of elements of > has a union and correspond- 
ingly strengthen assumption (3.3), the existence of a residual (German, ideal- 
quotient”) A:B follows for each pair of elements A, B of = with the defining 
properties 


A D(A:B)B; if A > XB, then A:B DX. 


It is easy to show that the residual thus defined has the formal properties of 
the residual in polynomial ideal theory (Macaulay, [3]). But in the special 
instance of ordinary arithmetic (when > is interpreted as the ring of rational 
integers) the residual has a number of additional interesting properties which 
do not hold in general; for example, 


(A:B, B:A) = I; (A, B):M = (A:M, B:M), 
M:[A, B] = (M:A, M:B), M:AB = |(M:A)(M:B)}:M, 
A:(B:A) = A(A, B):B, [A, B]:(A, B) = (A: B)(B:A). 


The problem arises then of determining the conditions under which these 
and other properties of residuation in ordinary arithmetic will hold in the 
abstract structure. It is not difficult to show that it suffices to assume* 

PostutaTE E. If A divides B, there exists a unique element Q such that 
AQ = B. 


Received June 22, 1937. 

1 Other terms are ‘“‘dual group’’, ‘‘Verband’’, “‘lattice’’. For a definition, see §2 of this 
paper, or O. Ore, reference [1] at the close of the paper. 

? The concept appears to be due to Dedekind [4]. See van der Waerden [2] or Ma- 
caulay [3]. 

3] here is the unit element with respect to multiplication. See §§2 and 3. 

‘ Postulate E is satisfied in every principal ideal ring. 
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But this solution of the problem is trivial. First of all, Postulate E is far 
stronger than necessary. Secondly, even if we weaken E by no longer requiring 
Q to be unique, the assumption is still undesirable because it renders the con- 
cept of the residual superfluous; A:B is merely a particular one of the quo- 


tients” We assume accordingly the weaker 


A 
(A, B) 

Postu.taTEe C. If A divides B, there exists an element P such that A = B:P. 

We shall show here that Postulate C alone suffices to prove that (A: B, B: A) = 
I, (A, B):M = (A:M, B:M) and M:[A, B] = (M:A, M:B). Postulate C isa 
sufficient, but not necessary, condition that = be an arithmetic structure, and 
a necessary condition that = be a Boolean algebra, if multiplication is identified 
with cross-cut. 

If we interpret our multiplication as the cross-cut operation of the structure 
(so that the structure is arithmetic by assumption 3.3), we obtain a residuation 
operation within the structure which does not seem to have been investigated 
even in common arithmetic. 

It is also possible to define residuation abstractly over the structure by a 
proper selection of the properties of residuation given in §4 without any reference 
to multiplication. This investigation has been carried out by R. P. Dilworth 
in an unpublished paper. 

A complete postulational analysis of the interrelationships between the struc- 
ture properties, and the operations of residuation and multiplication would 
appear desirable, but will not be given here. We shall content ourselves with an 
informal treatment, introducing our postulates on the basis of naturalness and 
convenience. Their consistency will be evident. 


2. We begin by recalling briefly the defining properties of a structure (Ore, 
[1]). We postulate the existence of a well defined division relation > which 
is transitive and reflexive. The equality relation = is then defined in terms 
of D by A = B if and only if A D> Band B DA. For any two elements A 
and B of = we postulate the existence of elements D and M such that 


DDA,DDB; if X DA,XDB, then X DD. 


ADM,BO2M; if ADY,BDY, then MODY. 


D and M are called the union and cross-cut of A and B. They are determined 
r . 6 
up to equal elements. We write’ D = (A, B) and M = [A, B]. 
If an element E divides every element T of a fixed subclass 6 of =, we write 


5 We define a quotient Q = ; as an element Q such that A = QB. See $4. 


6’ Ore uses [A, B] for union and (A, B) for cross-cut. We prefer to retain as far as 
possible the notation and terminology of ideal theory. 
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E >0. We state explicitly our assumption of closure of = with respect to 
union (Ore [1], p. 409): 

PostuLaTEe A. For every subclass © of elements of =, there exists an element U 
called the union of © such that 


U0; f X50, then XOU. 


We write U = u(O). 

In particular the element u() divides every element of =. We shall call it 
the identity element of = and denote it by J. 

If we assume the existence of a null element divisible by every other element, 
then the closure of = with respect to union obviously entails the closure of = 
with respect to cross-cut. Conversely, closure with respect to cross-cut and 
the existence of an identity imply closure with respect to union. 

If for every three elements A, B, C of = 


(2.1) (A, [B, C]) = [(A, B), (A, C)], 


the structure is said to be arithmetic, or distributive. (2.1) is equivalent to 
[A, (B, C)] = ({A, B], [A, C)). 

We recall that A > B if and only if A = (A, B) and B = [A, B], and that 
union and cross-cut are associative, commutative, and idempotent operations. 


3. We next assume that > is closed with respect to an associative and com- 
mutative operation X-Y or XY: 
(3.1) A-Bisin Dif A, Bare in 2; A-(B-C) = (A-B)-C;A-B = B.-A. 
We call this operation multiplication. We make the further assumptions 
(3.2) I.A = A for every element A of >. 
Here / is the identity element of the structure. 
(3.3) A-(B, C) = (A-B, A-C) for any three elements A, B, C of the structure. 
The following rules are easy consequences of these assumptions: 
B >C implies AB D> AC. A DB andC DD imply AC D BD. 
A = BC implies B > A. [A, B] D AB D[A, BI(A, B). 
(A, B) = I implies AB = [A, B] and (A, BC) = (A, C), any C. 


On account of Postulate A, assumption (3.3) must be strengthened, as from 
(3.3) we can merely deduce the distributivity of multiplication with respect to 
union for a finite number of elements. If 6 and # are subclasses of 2, we define 
their product 0 as the subclass of all products of elements of 6 and elements 
of @ If © consists of a single element 7, we write T& for Ob. We assume’ 


7 It suffices to assume merely that u(7#) = Tu(#) for the developments which follow; 
but this apparently weaker assumption is easily shown to be equivalent to Postulate B. 
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PostuLaTE B. The union of the product of any two subclasses of = is the product 
of their unions. 


II. Residuation 
4. The element R = A:B is called the residual of B with respect to A if 
(4.1) A D> RB; A D> XB implies R > X. 


The residual always exists for any A, B of =. For since A > AB, the class 
6 of all elements X such that A D> XB is non-empty. Let R = u(@). Then 
by Postulate B, since A D OB, A = u(A) D u(OB) = u(O)u(B) = RB, or 
ADRB. Andif A > XB, X lies in 9, so that R DX. 

The following properties of residuation which are needed later follow exactiy 
as in the ideal theory of commutative rings (van der Waerden [2], Chapter XII; 
Macaulay [3], Chapter 3): 


(4.2) A:A =I, A:I = A, ID A:BOA. 
(4.21) A:B = A:(A, B) = [A, B):B. 
(4.3) A:B = TJ, if and only if A > B. 
(4.31) If A:B = A, then A D XB implies A > X. 
(4.4) (A:B):C = (A:C):B = A:BC. 
(4.41) M:A = Bimplies AB:A = B. 
(4.5) M:(M:N) DN. 
(4.51) A D> Bimplies M:B D> M:A and A:M D B:M. 


TuHeoreM 4.1 (Macaulay [3], p. 32). If M and N are any two elements of 
Land A = M:N, B = M:(M:N), then B = M:A, A = M:B. 
A and B are then said to be mutually residual with respect to M. 


The quotient Q = A of two elements A and B of & is defined by 


B 
(4.6) A=QB; if A=XB, then QDYX. 
Unlike the residual, the quotient need not exist even if B > A. But if the 
class 6 of all X such that A = XB is non-empty, exists and is the union 
u(O). Clearly 4 = 1,5 = A, a exists. 


THEOREM 4.2. If the quotient ; exists, it equals the residual A:B. 


Proof. Let . = V, A:B = W. Then by (4.1), (4.6) A = BV — 


8 We use when convenient — and ~ for formal implication and equivalence to shorten 
the proofs. 
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ADBV—-W2>V-—BW D>BV—-BWD>A-—BW=A-VOW. 
W>VandV DW-—W = V. 
It follows from formula (4.21) that A:B = — 4 = [A, B] whenever the 
, rere (A, B) B 
indicated quotients exist. Consequently, if we assume 
PostutaTe D. If A divides B, there exists at least one element Q such that 


A = QB, 


ve 


(A, B)’ 


then the residual A:B reduces to the quotient We shall not assume 


Postulate D in this paper. 

We conclude this division of the paper with a lemma which we shall need 
subsequently. 

Lemma 4.1. Jf (R, S) = I, then (P,Q) D [P:R, Q:S]. 

Proof. Let P:R = U,Q:S = V. By (4.1), P D> UR, Q > VS so that 
(P, Q) > (UR, VS). It suffices then to show that (UR, VS) > [U, V]. 
(R, S) = I—-[U, V] =[U, V](R, 8S) = (RLU, VJ, S[U, V]). Now U >[U, V]—> 
UR D> R[U, V]; V > [U, V] ~ VS D S[U, V]. Hence (UR, VS) D> 
(R[U, V], SU, V}) D[U, V]. 


III. Distributive properties of residuation 


5. The following two “distributive laws” for residuation (Macaulay [3], p. 33) 
are due in essence to Dedekind [4]: 


I M:(Ai, As, ---,An) = [M:A1, M: Ae, --- , M:A,l, 
II [A,, Ae, ---, An]: M = [Ai:M, Ao: M, --- , An? M). 
In common arithmetic the residual satisfies the additional distributive laws 
Ill (Ai, Az, ---,An):M = (Ai: M, Ao: M, --- , An: M), 
IV M:[Ai, Az, ---, An] = (M:A1, M: Ae, --- , M:?A,). 


Since III and IV are easily proved by induction from the case n = 2, we shall 
discuss them here in the form 


(5.1) (A, B):M = (A:M, B:M), 
(5.2) M:[A, B] = (M:A, M:B). 


That III and IV need not hold in the abstract structure is shown by inter- 
preting = as the polynomial ideals of the ring K[x , 22, 23]. On taking M = 
(xi, 22, 23), A = (xi, 23) and B = (23, x3) we find that (A, B):M = (1), 
(A:M, B:M) = (aj, 23, 23). On the other hand, if M = (x22, 23), A = 
(xi, x3), B = (x3, 23), then M:[A, B] = (1), (M:A, M:B) = (a1, 22, 23). 

We now make the following assumption: 

PostuLaTEe C. If A divides B, there exists an element P such that A = B:P. 

The following consequences of C are needed in the discussion of (5.1) and 
(5.2) and serve to reveal its scope. 
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Lemma 5.1. If P and Q are any two elements of >, then (P, Q) = P:(P:Q). 

Proof. (P,Q) > P — (P, Q) = P:N by Postulate C. By Theorem 4.1 
and rule (4.3), (P,Q) = P:N = P:(P:(P:N)) = P:(P:(P, Q)) = P:(P:Q). 

Lemma 5.2. If Q > P, then Q = P:(P:Q). 

Proof. QDP—->Q = (P,Q) — Q = P:(P:Q) by Lemma 5.1. 

Lemma 5.3. If P D> M and Q > M, then M:P D M:Q only if Q D P. 

Proof. By Lemma 5.2, P > M,Q >M—P = M:(M:P),Q = M:(M:Q). 
Then by rule (4.51), M:P D> M:Q — M:(M:Q) > M:(M:P) — Q OP. 

This lemma is a limited converse of the first part of rule (4.51) which states 
that Q > P implies M:P D> M:Q. The direct converse is of course false. 

TueoreM 5.1. Lemma 5.1 and Postulate C are equivalent. 

Proof. Lemma 5.1 implies Lemma 5.2. And Lemma 5.2 is Postulate C 
with N = P:Q. 

Lemma 5.31. If P >MandQ>M, then M:P = M:Qif and only if Q = P. 

Proof. Q = P — M:P = M:Q by (4.51). M:P = M:Q —Q = P if 
P>M,Q > M by Lemma 5.3. 

Lemma 5.4. If M:N = M, then (M, N) = I. 

Proof. By Lemma 5.1 and rule (4.2), (M, N) = M:(M:N) = M:M =I. 

Lemma 5.4 shows that the distinction between “relatively prime” and 
“coprime”’ (Teilerfremd) elements, which must be made in the general theory 
(van der Waerden [2], Chapter XII, p. 30), vanishes if Postulate C is assumed. 

THEorREM 5.2. Lemma 5.3 and Postulate C are equivalent to one another. 

Proof. We have shown that Postulate C — Lemma 5.1 — Lemma 5.2 — 
Lemma 5.3 — Lemma 5.31; Lemma 5.2 — Postulate C. It suffices then to 
show that Lemma 5.31 — Lemma 5.2. By Theorem 4.1 and rule (4.21), 
P:(P, Q) = P:Q = P:(P:(P:Q)). Also (P, Q) D P, P:(P:Q) > P. Hence 
by Lemma 5.31, (P, Q) = P:(P:Q). This is Lemma 5.2. 


6. We shall now prove the important 
TuEoreM 6.1. If Postulate C holds, the structure = is arithmetic.’ 
Proof. It suffices to show that 


(i) (C, [A, B]) > [(C, A), (C, B)]; 


for we have trivially [(C, A), (C, B)] > (C, [A, B)). 

Assume Postulate C. Then by Lemma 5.1 and the first distributive law, 
(C, [A, B]) = [A, B]:{[A, B]:C} = [A, B]:{[A:C, B:C]} = [A, B]:M, where 
we have written M for [A:C, B:C]. Thus by the first distributive law, and 
Lemma 5.1, 

(C, [A, B]) = [A:M, B:M], 


[(C, A), (C, B)] = [A:(A:C), B:(B:C)]. 


Now A:C DM, B:C DM. Hence by rule (4.51), A:M D A:(A:C), 
B:M > B:(B:C), so that [A:M, B:M] > [A:(A:C), B:(B:C)], giving (i). 


® Stated by Garrett Birkhoff ((6], p. 619) for the special instance of the ideals of a 
commutative ring. We have made no assumption here that our structure is Dedekindian. 
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Postulate C need not hold in an arithmetic structure. For consider the 
numbers 1, 2 and 4 with both multiplication and cross-cut taken as L. C. M. 
and union as G.C.D._ In this arithmetic structure, 1:1 = 1:2 = 1:4 = 2:2= 
2:4 = 4:4 = 1; 2:1 = 2; 4:1 = 4:2 = 4. However, 4:(4:2) = 1, (4, 2) = 2. 
This contradicts Lemma 5.1. 

If = is a Boolean algebra and we interpret both multiplication and cross-cut 
as Boolean multiplication and union as Boolean addition, then A:B = A + B’ 
where B’ is the negative of B. Hence 


A:(A:B) = A:(A + B’) = A+ (A + B’)’ = A + A'B = A + B = (A, B), 


so that Postulate C is satisfied by Theorem 5.1. 
Lemma 6.1. If Postulate C holds, then for any three elements A, B and M of &, 


(M:A, M:B) = M:|(M, A), (M, B)}. 


Proof. Assume Postulate C. Then by Lemma 5.1, [(M, A), (M, B)| = 
[M:(M:A), M:(M:B)| = M:(M:A, M:B), by the first distributive law. 
Now (M:A, M:B) D M. Hence by Lemma 5.2, (M:A, M:B) = 
M:{M:(M:A, M:B)} = M:[(M, A), (M, B)]. 

THEOREM 6.2. If Postulate C holds, then for any three elements A, B and M 
of x 


(5.2) M:[A, B|] = (M:A, M:B). 

Proof. Assume Postulate C. Then by Theorem 6.1, [((M, A), (M, B)] = 
(M, [A, B]). Hence by Lemma 6.1, (M:A, M:B) = M:(M, [A, B]) = 
M:|{A, B], by rule (4.21). 

7. THEeoreM 7.1. If Postulate C holds, then 

(A:B, B:A) = I. 


Proof. Assume Postulate C. Then by rule (4.21), Theorem 6.2 and rule 
(4.2), (A:B, B:A) = ({A, B]:B, [A, B]:A) = [A, B]:[A, B] = J. 

TuHEeoreM 7.2. If Postulate C holds, then for any three elements A, B and M 
of = 
(5.1) (A, B):M = (A:M, B:M). 

Proof. (A, B) D A — (A, B):M D> A:M by (4.31). Hence (A, B):M D 
(A:M, B:M) and it suffices to show that 
(i) (A:M, B:M) > (A, B):M. 
(4.2) - A:M DA, B:M DB -— (A:M, B:M) 2D (A, B). Also (4.2) — 
(A, B):M > (A, B). Hence by Lemma 3.3, (i) follows if 
(ii) (A, B):}(A, B):M} D(A, B):}(A:M, B:M)}. 


By Lemma 5.1, the left side of (ii) is ((A, B), M) = ((A, M), (B, M)). By 
the first distributive law, the right side is [(A, B):(4:M), (A, B):(B:M)]. 
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Now by Lemma 5.1 and rule (4.4), (A, B):(A:M) = {A:(A:B)}:(A:M) = 
{A:(A:M)}:(A:B) = (A, M):(A:B). Similarly, we have (A, B):(B:M) = 
(B, M):(B:A). Thus the right side of (ii) equals [(A, M):(A:B), (B, M): 
(B:A)]|. By Theorem 7.1, (A:B, B:A) = I. Hence by Lemma 4.1, ((A, ™), 
(B, M)) > [(A, M):(A:B), (B, M):(B:A)]. This gives (ii). 


IV. Multiplicative properties of structures 


8. We shall deduce here two or three interesting consequences of the fourth 
distributive law. These are independent of Postulate C, which we no longer 
assume. 

THeEoREM 8.1. Jf the fourth distributive law holds, then multiplication is 
distributive with respect to cross-cut; that is, for any three elements A, B and M 
of = 
(8.1) M[A, B| = [MA, MB). 


Proof. Let N be any element of 2. Then by (5.2) and (4.4) N:[MA, MB] = 
(N:MA, N:MB) = ((N:M):A, (N:M):B) = (N:M):[A, B] = N:M[A, B]. 

On taking N = (MA, MB] and N = M[A, B] and applying rule (4.3) we see 
that M[A, B] > [MA, MB], |MA, MB] > M[A, B]. Hence (8.1) follows. 

Let Pi, Qi; Po, Ge; ---5Pn, Q, be n pairs of elements mutually residual 
with respect to a fixed element M, so that 


M:P; = Q:, M:Q; = P;, (i = 1, 2,---,m) 


Then the first and fourth distributive laws show that 
M:(Qi, Q2, —s » Q,) = [P;, P2, — » Pel, 


M:[P,, Po, +--+, Pal = (Qi, Qe, --- , Qn). 

Hence we have 

THEOREM 8.2. If the fourth distributive law holds and P; , Q; are mutually 
residual with respect to M for i = 1, 2,---,n, then [Pi, P2,---,P,] and 
(Q: , Qe, --- , Qn) are also mutually residual with respect to M. 

Pairs of mutually residual elements thus form a kind of structure. 

Lema 8.1. If multiplication is distributive with respect to cross-cut, then for 
any two elements A, B of = 


(8.2) AB = [A, B)(A, B). 


Proof. We always have [A, B] > AB D[A, B](A, B). Assume that (8.1) 
holds. Then [A, B](A, B) = [A(A, B), B(A, B)]. Since A(A, B) D AB, 
B(A, B) D> AB, we have [A, B](A, B) D AB. This gives (8.2). 

Formula (8.2) may be generalized as follows. Let S,, S:,--- , S, be any n 
elements of =, and let 


T, = Se-Ss3- --- -Sa, Te. = S,-S3- --- -Sa, +--+, Tn = Sy-Se- --- -Saa. 
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Then if formula (8.2) holds, we readily prove by induction that” 
(8.3) S,-S,- --- -S, = [S,, Se, ---, S)(T1, T2, --- , Tn). 


Formulas (8.2), (8.3) are thus consequences of the fourth distributive law. 


9. We conclude by giving some properties of structures which are semi- 
groups. The structure > is said to form a semi-group if 
(9.1) For any three elements A, B, C of 2, AB = AC implies B = C. 
This condition is easily seen to be equivalent to 
(9.2) For any three elements A, B, C of ©, AB > AC implies B DC. 
An equivalent condition in terms of residuation is 
(9.3) For any two elements M and N of =, MN:M = N. 


In a semi-group, a quotient Q = : need not exist if B D> A. But if it does 


exist, it is unique in the sense that A = QB is satisfied for only one value of Q. 
(9.1) may be restated as 
(9.4) For any two elements M and N of &, there is at most one element R such that 
MR = N. 
The equivalence of (9.1), (9.2), (9.3), (9.4) is independent of Postulate C. 
THEOREM 9.1. Let = be a structure in which multiplication is distributive with 
respect to cross-cut. Then if = is also a semi-group, > is an arithmetic structure." 
Proof. We are to show that (9.1) implies that 


(2.1) (A, [B, C]) = [(A, B), (A, C)]. 
By hypothesis and (8.1), (8.2) of the previous section 
[AB, AC] = A[B, C] = [A, [B, C]](A, [B, C]) = [[A, B], [A, C]](A, [B, C)). 
([A, B], [A, C]][(A, B), (A, ©)] 
= [[A, B](A, B), [A, B](A, C), [A, C](A, B), [A, C](A, C)] 
= [AB, AC, [A, B](A, C), [A, C](A, B)] 
= [[AB, AC], M], 
where 
M = |[A, B\(A, C), [A, C](A, B)). 
But [AB, AC] > [[AB, AC], M]. Hence 
[[A, B], [A, C]](A, [B, C]) > [[A, B], [A, CICA, B), (A, ©)). 


1° This formula is of great importance in common arithmetic. See, for example, Stieltjes 
[5], Chapter I, §§7-10. 

11 The converse of this theorem is of course false as is shown by the structure consisting 
of the finite ring of integers modulo 4. 
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Therefore by (9.2) and hypothesis 
(A, [B, C}) > [(A, B), (A, ©)). 
But we have trivially [(A, B), (A, C)] > (A, [B, C]). Hence (2.1) follows. 


On combining this result with Theorem 8.1, we obtain 
THEOREM 9.2. In a semi-group, the fourth distributive law for residuation is a 
sufficient condition that the structure be arithmetic. 
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ASYMPTOTIC RELATIONS FOR DERIVATIVES 
By R. P. Boas, Jr. 


1. Introduction. A well known theorem of Hardy and Littlewood states, in a 
form in which it is often quoted, that if f(x) is of class C’ on (0, ~), and if, as 
r— », f(r) = o(1) and f’(z) = O(x*), then f’(xz) = o(x"). It is a special 
case of a theorem in which the powers of x in the order relations are replaced 
by more general functions; and this in turn can be used to establish an extended 
theorem where from the order of f(x) and of f'” (x) (n = 2) one deduces the 
orders of the intermediate derivatives.’ Now, if we think of the hypothesis 
on f(x) in the original theorem as “zxf’(r) = O(1)”, it is a hypothesis on the 
order of the function resulting from applying a certain linear differential operator 
to f(x). The principal result of this note is the corresponding theorem when 

ir 
the operator z de 
operator, L; from the order of L[f(x)] and of f(x), the order of 


f(x) (k = 1, 2, so = 1) 


can be deduced. This result, and a preliminary theorem, overlap the results 
of Hardy and Littlewood, but neither include them nor are included by them. 
The full statement of our main theorem is somewhat complex; to illustrate it 
as simply as possible, a special case, sufficient for many applications, will be 
stated here. 

Let 


(1.1) Lif(x)] = YA’ f%C), 


is replaced by a certain more general, n-th order, linear 


where the A; are constants, A, # 0. Let f(x) be of class C" on (0, ~), and sup- 
pose that as x — ~, L{f(x)] < O(1). If f(z) = O(1), then f(x) = O(x); 
if f(x) = o(1), then f(x) = ofa) (k = 1,2, ---,n — 1). 

Examples of operators L[f(x)] which have the form (1.1) are x"f‘"’(x); the 
operator 


(—<g7" £"* ., a 
2! ae SO (k 2 2) 


Received June 23, 1937; presented to the American Mathematical Society, September, 
1937. 

1G. H. Hardy and J. E. Littlewood, Contributions to the arithmetic theory of series, 
Proceedings of the London Mathematical Society, (2), vol. 11 (1913), pp. 411-478; 417 ff. 

Reference should also be made to E. Landau, Uber einen Satz von Herrn Esclangon, 
Mathematische Annalen, vol. 102 (1929-30), pp. 177-188. Landau considers more general 
differential operators than we do, but his results are less general in other respects. 
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used by D. V. Widder to invert the Stieltjes transform;’ and the operator 
Ly,2| Lee f(x)]}, which inverts the iterated Stieltjes transform.’ In a later paper 
by D. V. Widder and the author, the results of this note will be applied to the 
theory of the iterated Stieltjes transform. 


2. Theorems with second derivatives. We shall denote by “7” and “|”, 
respectively, the classes of positive functions which are non-decreasing or non- 
increasing, 0 < x < x. We shall also consider the class of functions ¢(z), 
positive on 0 < x < «, and such that ¢g(cr)/g(x) is uniformly bounded for 
0 <2 < ~ and} Sc S 2; we shall call it “K”.* C” denotes the class of 
functions having continuous n-th derivatives on (0, «). With these notations, 
we can state 

THEOREM 1A. If g(x) and (x) satisfy any one of the conditions 

(a) o(z)e |, v(x)e |; 

(b) oz) e 7, W(x) FT; 

(c) z'g(z) eT, 2°*W(x) € Tt, with r = 0, and g(x) = O(2°Y(2)); 


then, for any f(x) «C’, as x — « all of the following statements are true: 


Ay. f(x) = O(e(z)) and f"(x) = O((z)) imply f(z) = Ole@)¥(2)"); 
Ag. f(z) = o(y(z)) and f"(x) = O(Y(z)) imply f(z) = ole@W(2)I'); 
As. f(z) = O(e(x)) and f(x) = o(Y(z)) imply f(z) = o(le(x)¥(2)|’), provided 


that ¢(x) = o(2°¥(xr)) is added to condition (c). 

THEoreM 1B. If o(x) ¢ K, ¥(r) € K, g(x) = O(2*Y(x)), and f(x) € C’, then, 
asr— &, 

By. f(z) = O(¢(z)) and f(x) < O(Y(x)) imply f(x) = Ole@)¥(@)!'); 

Be. f(x) = o(¢(x)) and f(x) < O(Y(2)) imply f(x) = o(fe(2)¥(2)); 

Bs. f(z) = O(¢(z)), f(z) < o(¥(x)), and g(x) = o(2*p(x)) imply f'(z) = 
o(le(x)¥(2))). 

Theorem 1A under hypotheses (b) and (c)’ and the special ease of Be, where 
v(x) and ¥(z) are powers of z,° are due to Hardy and Littlewood. 


2D. V. Widder, The Stieltjes transform, to appear in the Transactions of the American 
Mathematical Society. 

3D. V. Widder, The iterated Stieltjes transform, Proceedings of the National Academy of 
Sciences, vol. 23 (1937), pp. 242-244. 

4 Any constants a and A,0 <a<1< A < «, would do as well as } and 2 in the defini- 
tion of the class K. K contains, in particular, all those functions which J. Karamata 
calls ‘‘regularly increasing in the wide sense’. See J. Karamata, Sur wn mode de croissance 
régulitre des fonctions, Mathematica (Cluj), vol. 4 (1930), pp. 38-53, 194-195; the class of 
functions in question is defined on p. 40, and the theorems necessary to show that it is 
contained in K are on pp. 40, 45. 

5G. H. Hardy and J. E. Littlewood, op. cit., pp. 417, 425-426. 

°G. H. Hardy and J. E. Littlewood, Tauberian theorems concerning power series and 
Dirichlet’s series whose coefficients are positive, Proceedings of the London Mathematical 
Society, (2), vol. 13 (1914), pp. 174-191. On p. 188 they give, not precisely the theorem in 
question, but the corresponding theorem when z — 0+. 
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Theorem 1B does not include Theorem 1A; for example, let f(z) = ¢(2) = 
v(x) = e*; ¢(x) and (2) satisfy (b) of Theorem 1A, which is applicable to f(x); 
but there is no function @(x) « K for which f(x) = O(@(x)). Suppose, in fact, 
that e” < A@(r) (0 < x < x; Aa constant), and that @(cr)/@(r) = M (0 < 
xr<«;se>1). Then @(er) < M(x), 0(c’r) < M@(cr) S M’0(zx), and by induc- 
tion, 0(c"r) < M"@(x) (n = 1,2,---). But then e”” < A@(c"r) S AM"O(x); 
take x = 1; the result is contradictory for large n. Hence Theorem 1B cannot 
be applied to f(x). 

Theorem 1A does not include Theorem 1B, because the two-sided O-condi- 
tions of Theorem 1A cannot be replaced by one-sided O-conditions; this is 
shown by the example g(x) = e’, ¥(x) = 1, f(z) = —e*: here f(x) = O(¢(x)), 
f"(x) < 0 = o(¥(z)), but f(x) ¥ O(le(x)¥(x)}'). On the other hand, if the 
one-sided O-conditions of Theorem 1B are replaced by two-sided O-conditions, 
the resulting theorem is included in Theorem 1A (c).’ To establish this, we 
have only to show that it is possible to construct, given any 6(x) « K, a positive 
function u(x), such that for some r 2 0, x’u(x) € TF , and @(r) = O(u(z)), u(x) = 
O(6(x)). Since 6(cr)/O(x) <= M, uniformly for 0 < xr < ~,} Sec S 2, it 
follows (replacing x by cr) that @(cr)/@(r) < M’, and generally that 
6(c"x)/O(x) < M”", so that we have @(cr)/@(r) =s M"“ (2" S ¢ S 2"; 
n=1,2,---). Weassume, without loss of generality, that 0(0) = 0. Choos- 
ing r = Oso that M°2-"*' < 1, we define u(r) by 

u(x) = x ‘u.b. (00); 
O<t<z 
evidently 2'u(x) « fT ; and w(x) 2 O(x), so that 6(x) = O(u(x)). We can define, 
for each x, a t, (O S t. S 1), such that (t,7)'@(t-r) > 32°u(zx), 


u(x) — 20; 0(t. 2) 
O(x) 6(x) 


Let x — x. If we ean show that for x sufficiently large, and for some m 2 1, 
t, > 2°", we shall have u(r) /0(7) < 2M”, and hence u(x) = O(6(x)); this will 
complete the construction. But if the ¢, are not bounded from zero, we cannot 
have t, = 0 (x > 0), and there is a sequence 2; with ¢,, approaching zero. There 
are then integers n, = 1 with 2°" = t,, > 2°"*' for sufficiently large k, and 


u(a,) < mM"™*! 
(2%) adie 


This contradicts u(r) = A(x). 

There is no theorem corresponding to Theorem 1B with the rdéles of the 
equality and inequality signs interchanged; that is (taking for example B)), 
if g(r) e K, ¥(x) € K, o(x) = O(2*Y(x)), then f(x) < O(e(x)) and f’(x) = O(Y(z)) 
do not necessarily imply f’(7) = Ole(x)¥(2)}'). A little reflection shows that 
this state of affairs is to be expected; the reader will have no difficulty in con- 
structing examples to illustrate it. 


“t% 


? That this might be true was suggested to the author by the referee. 
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From the point of view of this note, Theorem 1B is more important than 
Theorem 1A, because it will be used in establishing our main theorem. For 
the sake of completeness, however, we shall give a proof of Theorem 1A under 
hypothesis (a) (this is the case not discussed by Hardy and Littlewood), and 
indicate how the same method could be used for the other cases. No originality 
is claimed for the proofs of Theorems 1A and 1B; the method of proof is an 
obvious modification of that by which the theorems are usually established 
when g(r) = 2“, (x) = 2S 

Let 6(x) be any function, nowhere zero, such that « + 6(x) > 0 for x > 0. 
Then Taylor’s theorem with remainder of second order, applied to f(x), can be 
written 
] 

, a 
(2.1) f'(x) = a(x) 


where 6 = 6(z, 6(x)) satisfies 0 < 6 < 1. 
Now, if e(x) is any positive function, (2.1) is valid with 6(7) = e(x)[e(x) /W(x)]}, 
and we have 


f(a) _ e(a) 
eal = eady(ay I + 3)! + 1I@) I + aris 


If we assume hypothesis (a) of Theorem 1A, 
e(x + 4(x)) S ¢(z), 
with a similar inequality for ¥(x), and (2.2) gives 
e(x)O(1) + O(1)/e(2), 
(2.3) feogtnp $ «OU + o(8)/e(2), 
\e(x)o(1) + O(1)/e(x), 


according as we consider A;, As, or Aj, respectively. To establish Aj, we 
take e(x) = 1; to establish As, we take for e(x) a function which is o(1) but of 
sufficiently slow decrease; to establish A;, we take for e(x) a function of suf- 


[f(x + 6(x)) — f(x)] — 26(x) f(x + 66(x)), 


(2.2) 1 f’'(x + 06(2x)) |. 


ficiently slow increase, with 1/e(z) = o(1). 
Theorem 1A under hypothesis (b) can be established in a similar way, but 
more care is needed. It is natural in this ease to take 6(x) = — e(z)[¢(x)/¥(z)]}, 


where ¢(z) is still a positive function, to be chosen as before; but such an argu- 
ment can be used only as x — & on the set E, of points where ¢(z)/¥(z) S 
(x/e(x))*, since on the complementary set, E2, 6(z) < — zx, and (2.1) is no 
longer valid. But as x — © on Ez, it is legitimate to suppose that f’(0) = 0, 
and the conclusions of the theorem can then be deduced from 


if(@| < [ “| fb) | dt, 


8 See, for example, E. Landau, Darstellung und Begriindung einiger neueren Ergebnisse der 
Funktionentheorie, 1929, p. 58, where a proof is given for the corresponding theorem when 
xz — 0+; the modifications necessary when x — ~ are trivial. 








se i st 
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and the fact that y(x) « |. Under hypothesis (c), the proof is simpler, since (c) 
includes an auxiliary hypothesis which in effect makes E, empty. 

We now establish Theorem 1B. We have g(x) S o(x)z*¥(x), where o(x) = 
o(1) for Bs, and (without loss of generality) we may take o(x) = 1 for B, 
and B,. Let ¢(x) be a positive function, «(z) < [4e(x)J*. Let a(z) = 
+e(2x)[e(x)/p(x)}'; then | 6(z) | < 3x, and (2.1) is valid; we may write (2.1) in 
the form ; 

— —1 /y(z) e(x) (e(z)\’ », 
(2.4) +f (x) = €(x) (2) [f(x + 6(x)) f(x)] + xy (e@) f (x + 05(x)), 
where 0 < @ < 1, and the sign taken in (2.4) is that of —é(x). Now, f’(x) s 
n(x)¥(x), where n(x) > 0, n(x) = O(1) for B; and Be, and n(x) = o(1) for B;. 
The function A(z) = ub. ss n(t) has the same properties, and n(x + @4(x)) S 


A(x). Then — 

os) FP) 5 _fle +) -f@) , AW + 0) 
[e(x)y(x)}! e(x) 9(2) 2y (x) 

Since ¥(x) eK, (x) « K, and | 6(x) | < 4, there is a constant B such that 

(2.6) ¥(x + 68(x))/Y(x) = B, o(x + 4(x))/e(z) S B; 

and hence 


f(x + 4(z)) | _ | f(x + 62) | o@ + 6@)) — B | f(x + (zx) | 
¢(x) 


g(x + 6(z)) ot) p(w + H(z)’ 
so that 
(2.7) i f(z + 4(z)) |/e(z) = (a), 
where 7(x) = O(1) or o(1), according as we consider B,; and B;, or Bz. Then 
(2.6) and (2.7) reduce (2.5) to 
F f'(x)le(a)W(x)J* < +(x)/e(z) + 4B e(x)A(2). 
For B,, we take ¢(z) = 4; for Be, we take e(z) = min [3, (r(z))']; for Bs, we 
take e(x) = min [(A(z))~, (50(x)) 4}. 


3. A definition and a lemma. It is convenient to have a name for the differ- 
ential operators which we shall consider. We introduce the following 
Derrinition. A linear differential operator 


(3.1) LU) = X postage) 
with 
(3.2) pi(x) = Do bisa’, 


where the b;; are constants, and bo, = 1, shall be called a generalized Euler operator.’ 


® The name ‘“‘generalized Euler operator’ is used because in the special case, mentioned 
in the introduction, when p,(xz) = bx", L[f(x)] = g(x) is what is known as an Euler differ- 
ential equation. 
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Lemma. If L[{f(x)] is a generalized Euler operator of order n, f(x) is of class 
Cc” (0 S$ z < «), anda is an integer, 0 < a S n — 1, thenaszr— ~, 


n—a—2 
[ (x = NLfldt = °F" "e+ 0) + LY f° WOR") 
(3.3) a 
+ Do Aj2' | f(QOW™) dt + O(2"), 
i=0 0 
where the A; are constants, and c is a constant ~ 0.° 


Let L[f(x)] be the operator adjoint to L[f(x)]. Then if g(x) is any function 
of class C", 


(3.4) [ gL [f®]dt = [ SOL[g@lat + PLfO, g@™1 iz, 
where 

(3.5) Li{g(t)] = > (—1)'[p.-Og OI"; 

(3.6) PIs, 1 = . (<1 fF" O lp ag Ol? 


If g(t) = (x — t)", and the differentiations are carried out in (3.5) and (3.6), 
we find 


[ (x — t)’L{f()dt = .. Ajax [tora ~ > Aie' 
0 i=0 


+ DL f° @Pisasle) + f° ?@(ex" + Pra), 
i—o 


, . . . 
where A; and A; denote various constants, and P;(t) is a polynomial of degree k 
at most (not necessarily the same at each appearance); from this (3.3) follows 
at once. The details are left to the reader. 





4. A theorem with generalized Euler operators. We shall consider a pair of 
positive functions 9(z) and ¢(x) satisfying what we shall call 

Conpitions A.’ 

(i) g(x) « K; 

(ii) 16(z) = O(¢(z)); 

(iii) [ 6(t) dt = O(¢(z)); 


0 


Ir 
(iv) g(t) dt = O(xg(xr)).”” 
0 

10 Actually, c = (—1)‘*a!. 

11 See, for example, E. L. Ince, Ordinary Differential Equations, 1927, pp. 123-124. 

#22 The author is indebted to the referee for the elimination of a redundancy in Condi- 
tions A. 

13 It is supposed throughout this section that z—> «. In particular, if @(z) > 0, and 
6(z) is (in Karamata’s terminology) regularly increasing in the wide sense, then @(z) and 
¢(z) = x6(x) satisfy Conditions A. The results needed for verifying this can be found 
in Karamata’s paper cited in footnote 4. 
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We now establish our main theorem. 

THEOREM 2. Let L[f(x)] be a generalized Euler operator of order n, and let 
f(x) of class C” on (0, ~) be a solution of the differential equation L{f(x)] = g(z). 
Let 0(x) and ¢(x) satisfy Conditions A. If,asx— =, 


(4.1) f(z) = O(6(z)), 
(4.2) ‘ g(t)dt < O(¢(z)), 
then 
(4.3) f?(x) = O(2” "¢(z)) 
for p = 1, 2, --- ,n — 2; and also for p = n — 1 of in addition 
(4.4) g(x) < O(6(z)). 

If also 
(4.5) f(z) = 0(6(x)) 
and 
(4.6) [ O(t)dt = x, 
then 
(4.7) f(z) = o(x ” “g(z)) 


for p = 1, 2, --- ,n — 2, and also for p = n — 1 tf (4.4) is satisfied. 

If the conclusion of the second part of the theorem is needed only for 1 S p S 
n — 3 (or n — 2 if (4.4) is satisfied), it is a direct consequence of the first part 
(even without (4.6)), by successive applications of Theorem 1B, since by Condi- 
tion A(ii), 0(z) = O(a "¢(x)), while (x + 1)‘y(x) ¢ K for any real number k 
if g(x) « K. 

Theorem 2 has content only when n 2 2 if (4.4) is satisfied, and only if 
n = 3 when (4.2) alone is satisfied. If n = 2, L[f(x)] = 2°f’(z), and 20(x) = 
g(x), Theorem 2 reduces to the special case of B; and B, of Theorem 1B where 
g(x) = 2’¥(x) (except that Conditions A(ii), (iii), (iv), and (4.6) are then 
redundant). If L{f(x)] = 2"f‘(x), o(z) = x0(r) = x‘, and the one-sided 
O-condition (4.4) is replaced by a two-sided O-condition, Theorem 2 is a special 
case of the extension of Theorem 1A given by Hardy and Littlewood (who use 
a more general function ¢(zx))."* 

If we take 0(r) = (x + 1)” log (x + 1), o(x) = [log (x + 1))’, we obtain an 
example illustrating the theorem when ¢(x) # 2x6(x); and the theorem with 
these functions has applications.” 


' Reference in footnote 5. 
18 It was used in the author’s Harvard thesis (unpublished). 
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To establish Theorem 2, we start from the familiar formula 


G,(x) = [@-or oa - mt [ dtm [re = “f at, [° g(t)dt (m>0). 


From Conditions A(ii) and (iv) we obtain, for p 2 


[ g(t? dt = o(» [ g(t) it) = O(x”*' y(z)); 
[ “ol dt = o( [ “ ptor"at) = O(2? oz); 


and it follows, by use of (4.2), that 
(4.9) G(x) < O(x"¢(x)) (m = 0,1, 2, ---). 
Now, by Condition A(iii), 


[ “a(t dt = O((2)), 


1/e(z) = oft / I ' ai at} = 0(1), 


(4.10) z* = O(2* o(z)) (k = 0). 
Since f(x) = O(@(x)), from Condition A(iii) and (4.8), we obtain 


(4.8) 





(4.11) ¥ Aca’ [ “FOU at = O(z" g(x). 


1=0 


We use (4.10) and (4.11) in (3.3), and, remembering that L[f(x)] = g(x), we 
obtain 


Giz) = 2" (ale + of) + DS f° WOR) 
(4.12) a 


+ O(x* o(z)) (lsasn-}). 


Suppose now that (4.3) has been established for 1 S p S 9,q Sn — 2. 
Then (4.12) fora = n — q — 1 gives 


(4.13) Gr—~a(z) = O(2" *'g(2)), 


and since G(x) = m(m — 1)G,-2(x) (m = 1; we set G(x) = g(x)), we have 
by (4.9) with m = n — q — 3, 


(4.14) Gr-a(z) < O(2" *“¢(z)) 


when g < n — 3, and also when q = n — 2 if (4.4) is satisfied. But since 
o(x) «eK, (a + 1)""¢(x) « K; and by part B, of Theorem 1B, (4.13) and 
(4.14) imply 


(4.15) Gi,_ga(z) = (n — q — 1)Gr-g-2(z) = O(2" *“g(z)). 








ve 


1). 
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By (4.12) fora = n — q — 2, and by (4.3), assumed established for 1 < p < q, 
(4.16) Gr—g2(z) = 2°f"(x)(e + o(1)) + O@" *“y(z)). 
Comparing (4.15) and (4.16), we have, since c ¥ 0, (4.3) for p = q + 1, provided 
only that 1 S$ q S n — 3, orl S q S n — 2, according as (4.4) is not or is 
satisfied. Hence the first part of Theorem 2 is true as soon as we verify (4.3) 
for p = 1. To do this, we have, from (4.12) with a = n — 1, 
Gra(z) = 2"f(x)(c + o(1)) + O(2" “¢(z)) 
= 0(z""¢(z)) | 
by use of (4.1) and Condition A(ii). Then by (4.9) with m = n — 3, or by 
(4.4) ifn = 2, 
Gns(x) < O(z"“¢(z)), 
and by Theorem 1B, 


Gn-2(x) = x"f'(x)(¢ + o(1)) + f(z)O(e"™) + O(z"“g(z)) 
= O(z"“g(z)), 


so that 

S'(z) = Of ¢(z)). 
This completes the proof of the first part of Theorem 2. The proof of the 
second part is similar. 


Since | 6(t) dt diverges, in place of (4.10) we have 
0 


(4.17) z* = o(z'y(z)) (k = 0). 
In place of (4.11), 
»» Aya’ [ “SWO(C™) dt = o(2* (2). 
Hence (3.3) gives us, in place of (4.12), 
n—a—2 
G(x) = xf" (ae + of) + DL f° WO") 

(4.18) 1=0 

+ o(z* ¢(x)) (lsasn-1). 


The remainder of the proof is exactly parallel to the proof of the first part of 
the theorem, part B: of Theorem 1B being used instead of part B,;. The details 
are left to the reader. 


5. Conclusion. We have stated our theorems with x — ~. It is clear that 
they hold, with obvious modifications, when z — 0+. The proofs are given 
most simply by modifying the reasoning directly, rather than by a change of 
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variable. The classes — and | are replaced by the classes of functions non- 
decreasing or non-increasing as x — 0+; that is, | becomes | and reciprocally. 
The class K is replaced by the class of functions g(x) with g(1/r) « K. Since it 
may not be quite evident what becomes of Theorem 2, we state the modified 
theorem in detail. 

THEOREM 3. Let 


i LU @)] = LX pases (2), 
V *  ) p@® = Ebi, be = 1. 


Let f(x) of class C" on0 < x < = bea solution of the differential equation L{f(x)] = 
g(x). Let 0(1/x) and ¢g(1/x) satisfy Conditions A. If, as x + 0+, 


f(x) = O(6(2)), 
[ g(t)dt < O(x"** g(z)), 


then 
f(x) = O@ ?"9(z)) 
for p = 1, 2, --- ,n — 2; and also for p = n — 1 tf in addition 


(5.1) g(x) < O(a "0(zx)). 
If also 
f(x) = o(6(z)) 
and 
[ a(t) dt = x, 
then 


f?' (x) = o(x?™ g(z)) 
for p = 1, 2, ---,n — 2, and also for p = n — 1 if (5.1) is satisfied. 


HarvVARD UNIVERSITY. 
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TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL 
SUMMABILITY 


By Morris L. Ka.es 


The following work is divided into two sections. The first deals with a 
Tauberian theorem of summability methods related to Borel summability, and 
the second with a Tauberian theorem of summability methods related to Abel 
summability. 

In 1925 Robert Schmidt gave a proof of the following theorem [6]:' 

If a series is Abel summable to the value s, and if the partial sums s,, satisfy the 
condition 

r— 7n 


: n 
lim (s,, — s.) 20 whenever —— —-0 (m > n), 
- n 


then 


lim s, = 8. 
n—v0 
In the same year Schmidt gave a proof of an analogous theorem concerning 
Borel summability. This theorem states [7]: 
If a series is Borel summable to the value s, and if the partial sums s,, satisfy 
the condition 


, m 
lim (s,, — s,) 20 whenever —— = 0 (m > n), 
n 


then 
lim s, = 8. 


In the following two years Vijayaraghavan [10, 11] gave a new and more 
elementary proof for each of these theorems. 

It will be observed that in each of the preceding cases the method of sum- 
mability is a power series method, and that the condition imposed on the 
sequence of partial sums is of the following type: 

m—n 


(i) lim (s,, — s,) 2 0 whenever —0 (m > n), 
o(n) 


where ¢(n) is an increasing function of n which tends to © with n. 


Received June 30, 1937. The author is indebted to Professors J. D. Tamarkin and 
O. Szisz, who suggested this problem to him. 

! The numbers refer to the list of references at the end of this paper. For an extensive 
bibliography on the subject of Tauberian theorems see N. Wiener, Tauberian theorems, 
Annals of Mathematics, (2), vol. 33 (1932), pp. 1-100. 
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These facts suggest an interesting possibility of extension. It is natural to 
ask: corresponding to any increasing function ¢g(n) which tends to infinity 
with n, does there exist a power series method of summability such that every 
series which is summable by that method, and whose partial sums satisfy condi- 
tion (i), is convergent to the same value s? In particular we may take the case 
when ¢g(n) = n* (0 < a@ <1). [Cf. J. M. Hyslop, 2.] This is a special case of 
Theorem I which is proved in the first part of this paper. When a = 3, we 
obtain the case of Borel summability as a special case, but the case of Abel 
summability, for which a = 1, is not included in Theorem I. The latter case 
is the point of departure for Theorem II, which is proved in the second part of 
this paper. 

I 

By combining the methods of Vijayaraghavan and Valiron [9] I have been 
able to prove the following theorem which I now proceed to formulate. 

Let 


> g(nye *” 2" = F(x) (x > 0) 


be a power series with radius of convergence R. (2 may be finite or infinite.) 
Let the function G(x) satisfy the following conditions: 

I. G(x) has a second derivative G’’(x) which is positive and which tends mono- 
tonically to zero as x tends to infinity. 

II. There exists an increasing function ¥(x), which tends to infinity as r > «, 
such that 


Ga) _ eee el 
G(x) =1+o471} whenever |21—2| G(x) 


III. There exists a decreasing function H(z) ~ G’'(x) (« — ) such that 
2/H(x) has an inverse K(x) which has a continuous second derivative and 
satisfies for all large x relations of the form 

(i) AK(xr) < 2K'(xr) < BK(z), 

(ii) 2° | K"(x) | < CK(a), 
where A, B, C are positive constants. 
IV. lim ~ —— =o 


zo ] 
Finally, let the function g(x) be defined as follows: 
V. g(x) = x°L(x) > 0, 
where o is any real number and L(z) satisfies the condition 


lim L(x) 


lim Gy = 1 for every fixed \ > 0. 





saan 
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THeoreEM I. Let the functions G(x) and g(x) satisfy relations I-V inclusive. If 


lim re > sig(nde "2" = 8 (s finite), 
and if 
lim (s,, — s,) = 0 whenever (m — n)/G"(n) > 0 (m > n), 
then 


lim s, = 8. 
200 

For the cases when g(x) ~ 1 asx — ~, or g(x) = x°L(x) with « > 0, condition 
IV is not required. 

The proof of this theorem depends upon a number of lemmas which are 
analogous to results obtained by Valiron and Vijayaraghavan. Before turning 
to these lemmas, it will be useful to enumerate a few of the immediate conse- 
quences of conditions I-V which can be easily verified. 

(A) Condition IT implies that 


lim 2°G’"(x) = @. 
(B) Without any additional assumption, the function ¥(x) of condition II may 
be replaced by any function g(x) which tends to infinity as z — © and satisfies 
g(x) S ¥(2). 


(C) The series 
> 82" = F(z), Dd g(nye" x" = F(x) 
1 1 


have the same radius of convergence. 
(D,) If g(x) = x°L(x) with o > 0, or g(x) ~ 1 as x — «&, then there exists a 
positive constant K such that 


Gm”) -K if 


lsonsm. 
g(m) 


(D,) If g(x) = 2°L(x), where o is any real number, there exist positive con- 
stants K and a@ such that 


1”) - Km if 1sngm. 
g(m) 
(D.) If g(x) = x°L(x), where o is any real number, there exist positive con- 


stants K and a@ such that 


g(m) < K("") if lindsm. 
g(n) 


n 
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In proving C and D use is made of the following representation of the func- 
tion L(x) which is due to Karamata [8, p. 45]: 


(1) L(x) = c(x) exp [ t'e(t) dt, 
where 
(2) lim c(x) =c¢ > 0 
and 
(3) lim e(r) = 0. 
Also, from (1), (2), and (3) it follows that 
.  L(vr) _ 
(4) lim “a * 1 


uniformly for all \ in any finite interval not containing the origin. 
Let —& = [y + 1], where y = y(z) is the solution of the equation 
G'(y) = log z. 
We are now ready to prove 
LemMA 1. Let N; = N,(é) and Ne = No(é) be two positive integers which 
satisfy the relations 


(i) Ne-—-&=§E-— MN (N2 > &), 
(ii) lim (Nz — £)°G’(N2) = &. 
i-~2 
Then 
No—1 


° ] . at 

lim ., g(nje "zx" = 1. 
z—R F (xr) 2», 7 

Lemma 1 is an immediate consequence of Lemmas 1, and 1, which follow. 

LEMMA 1, . 


Ni 


. 1 Gin) nm 
lim > g(ne “ = 0. 


zk F(x) 4 


8 


Let M be any positive integer such that 1 S M < & — 1. Writing T,(2) 


for e “'"’x" and following the method of Valiron [9], we can easily show that 
(1) eg gts (0<psM-—1). 
To prove this, we first apply Taylor’s expansion and get 

(2) G(M — p) = GE -—1— p) + (M -— £4 IGE — 1 — p) 


(M —+1)_,, 
+ ; + @"(M,), 




















Ww. 


UM a ’ 
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where M — p < M, < — — 1 — p, and then make use of the facts that @’’(z) 
is a decreasing function of x, that G’(z) is an increasing function of z, and that 

log xz = G'(y) 2 G@(E — 1) 2 GE — 1 — p). 
Now let us consider the case where g(x) = 2°L(x) with o > 0, or g(r) ~ 1 
asx— «x. Then by D, 
. M ) 
(3) aed 
g(é — 1 — p) 
Combining (1) and (3) and placing M = N, we get 


< K. 


(4) g(Ni — p)Tx,-» < Ke iste ® 


OspsN,-1). 
g¢—-1—p)Tra> ~ ( . . ) 


Hence 


N,-1 


(5) > g(n)T,(z) = 2 g(N: — p)T,-p 


n=1 


< Ke etter ® b> g(t = t p)T+-1-p < Ke Meier ® | ae), 
p=0 

Since — < Nz, and therefore G’(£) = G’’(Ne), it follows from (i) and (ii) 
that 
(6) lim (NV, — & + 1)°G’() = @ 

§-2 

Combining (5) and (6) we have 
(7) lim i 52 > o(n)T (x) = 0. 


Now let us consider the case where g(x) = x°L(z) and @ is any real number. 
Then by D, positive constants K and a@ exist such that 


g(M oP p) . a ra 
<K(é—-—1—p)*<K 
(8) g(§ —1—p) « . : 
(lsM st-—1;p=0,1,2,---,M — 1). 
Combining (1) and (8) we have 
(9) q( M wat pP)T u_» 
g(é — l = p)T:- l—p 


Let 6 be a fixed positive number such that N = [(1 — 4)&] + 1. Then writing 


< Ee ee ene, 


(10) KG \4 > g(n)T, (zx) = ri D Yo)? + Me ) > g(n)T,, = I, + Ie, 


we obtain from (9) 


(11) ae 
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But by condition IV and A 


(12) fen PEP a"(E) — alogé = lim Fores - soit - 
Thus, combining (11) and (12), we get 
(13) lim J, = 0. 
z—R 
Since lim L(\x)/L(xz) = 1 uniformly for \ in any finite interval not con- 


taining the origin, it follows that, for sufficiently large £, if (1 — 5) <n <m < &, 
then 


g(n) n° L(n) 3 ae 
(a4) g(m) ~ m*L(m) ~ 2(, - :) ae 


Setting M = N, in (1) and combining with (14) we see that 
g(Mi — P)T w,-» 


— 1a pT, <RO OS P< M—U—O8. 

e —il—p 
Hence 

1 Sey? . 

I= peg YE 9Ni— p)Tu-» < K exp (HM — £ + D°G"@) 
b— 

(16) 1 —~ 4 1 , 2ner 

. F(z) > g(é sol Sen pP)T:-1~» <K exp [—3(M,1 = g + 1) G (é). 
From (16) we see that 
(17) lim Iz = 0. 

z—R 

Combining (13) and (17) gives 
(18) lim PG » 4 > g(n)T (x) = 


It will be observed that in proving Lemma 1, , condition IV was not required 
for the case where g(x) = x°L(x) with o > 0, or g(z) ~ 1. Since this is the 
only place where condition IV is used, it follows that condition IV is not re- 
quired for the theorem if g(x) = x2°L(x) with o > 0, or g(x) ~ 1. 

LEMMA |),. 


x 


. ] Gim) m 
lim |. g(mje™ 2” = 0. 
lim ea) 29 


If we apply the theorem of the mean twice, it is easy to show that 
(1) ‘fo (§ <n Sm). 


In particular it follows from (1) that 7,,./7,, 1; i.c., the terms 7',(x) are 


monotone decreasing with respect to n for n 2 


rer lA 








erm ae. 
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By D. there exist positive constants K and a@ such that 


(2) vm) < K(") (lsnzswm). 
Hence 

. g(m)T,, ° = | ~(m—n)(n—8)G""(n) 

a : < _- : 

(3) g(n)T, x (” - 

Thus 


> g(m)T. _ g(n)T, > g(m)T », 


(4) KG 2) w=n F(x) =n g(n)T,, 
Kg(n)T,, = =) (m—n) (n—§)G"'(n) 
<q) (“ ; 
But 
- m Pes (m—n)(n—B)G'"(n) & . (m = n)* ,—(m—n) (n—8) GE" (n) 
x (“) , = aD ia ne \ 
(5) 
1 
salve ar + (n— smart} 
Combining (4) and (5) we have 
KK.g(n)T, l 
) Fa >> Hate < “He {+ Tau BG) yi —aerah 
Now by (2) we have, if § S n S m, 
7” g(n)T, 1 ay 
@) g(m)T,, - K (2 


Let N = [(1 + 4)é], where 6 is a fixed positive number such that 0 < 6 < 1, 
and let M be any positive integer such that & < M < N. Then 


¥ a(n). = = (Mu OT S gate SL (77) 


(8) n=t QJ g(M)T n= K M 
> ool (M — 8) ( 3 ;) 
Thus 
g(M)T “g(n)T, K(1 +68)" | K(i +6)" 
9 ‘ ; 
” F(z) “ = F(x) M-e# - =& 


Let K°K,(1 + 6)" = K,. Thenif N < Nz, we set n = N in (6) and M =N 
in (9), and combining the two we get 


1 «x 
si F(a) x. mT, < eG = g(m) T 


, N <Q). 
" Kit y —€ . (N —£)?@’""(N) + {(N — seeprenyyr} (Vs 
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If Ne S N, we may set M = Nz in (9) and n = Ne in (6), and combining we 


obtain 


1 1 
F(a 3 GT. < Ms iw: + Ww pe"Wy 


1 
(Ne — &)?G'(N2) ji 


(11) 


oo \ (N2 SN). 


Since by A, lim 2°G’’(xz) = «, it follows that 


1 . 1 
2 n < = 0 
(12) ane (N — §)*?G”(N) baw &N?G’(N) ; 


Thus from (12) and condition (ii) of Lemma 1, it follows that the right sides 
of (10) and (11) tend to zero as § — «x. Hence we conclude 


(13) lim rG = g(m) T, = 0. 


Lemma 2. U’nder the conditions of Lemma 1, 


lim vG (N2) >> (m — No)g(m)e” 2” = 0. 
z—R F(x) m=Ne 


By (3) in Lemma 1 we have 


re ) > (m — n)g(m)Tn = g(n)T'n > (m — n) Im Tm 


_ F(z) mn g(n)T 
(1) < xo Dim — n) ("J ae 
< KK.g(n)T, l 1 
F(z) {(n—-BG"(n)}2 + (n—8"G"(n)ef 
Let N be defined as in Lemma 1,,. Then if N < Ne, we have 
vee > (m — N2)g(m)T,, S Wr ni > (m — N)g(m)Tn 
(2) —e m=N 


and if No = N 


VG"(N2) he (m = N2)g(m)T 
(3) I (x) m=No 


, I 1 
< Kil oy — $)2G""(N2) + , > 





" ] 1 
< Ki) oy 7 §)?G""(N) + \(N pas st 


(No — &)? rengrp 








nr: 
ase nn 
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From (2) and (3) we conclude, as in Lemma 1,, that 


2 
(4) lim - ve (Ns) dX“ (m — N2)g(m)T» = 0. 
— Ss ch 

The following lemma concerning the slowly decreasing sequence {s,} can be 
proved in precisely the same manner as the analogous lemma proved by Vija- 
yaraghavan [11, p. 319, Lemma ¢]: 

Lemma 3. If lim (8, — s,) 2 O whenever (m — n)V/G"(n) > 0 (m > n), 
then to every positive number c there corresponds a positive number k = k(c) such 
that 


8m — 8 > —{k(m — n)V/G “(n) + ce}. 


Lemma 4. Under the conditions of Theorem 1, if 


2 


lim Zz. srg(nde "2" = 8 (s finite), 
z>k FG -» 
and if lim (8m — 8) 2 0 whenever (m — n)V/G"(n) > 0 (m > n), then 


= O(1). 


With the aid of Lemmas 1, 2, and 3 this lemma can be proved in the same 
way as Lemma 1 of Vijayaraghavan’s paper on Borel summability [11]. 

Lemma 5. Under the conditions of Theorem I, if |A,} be a bounded sequence 
of numbers, and if 


lim > A,g(n)e “"'x" = A, 


z—R FG ) 
then 
lim FG y An , ae 


Let 6 be a positive number such that 0 <6 < 1. Let Ny, = [(1 — d)E] and 
N. = [(1 + 4)é]. The conditions of Lemma 1 are satisfied by these defini- 
tions of N, and Ne. Hence 


(1) lim cy a g(ne = 1, 
and 
(2) lim pc, ) > g(nye Gin) r” = = lim PG 2 : g(nye Gin). n 


Since the sequence {A,} is bounded, it follows that 


(3) lim ic, x A,g(nye Ox" = lim pc, >> A,g(nje "2" = 
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and 


N2-1 


4 li A,g(nye ° 2" = A, 
( ) om i ) 2», g(nye 
As was noted under D, we have 


. Lira) 
(5) me TF 


uniformly for all \ in any finite interval not containing the origin. Now if 
Ni+1s nS Nz —1 then 


= 1 


(6) (1 — de Sn S (1 + 4)E, 

and therefore by (5) 

 1-&)< a <14e2) (N41 S08 Ne—1;e(2) 0,28). 
If we let 6, = 6fore 2 O0and 6, = —é fore < 0, then 

(8) (1 — 6)" < (2) < (1 + 5)". 

Combining (7) and (8) we get 

(9) (1 — ea) — 4)" < ate) < (1 + e(2))(1 + 84)". 


Since the A, are bounded and the summability methods under consideration 
are regular, there is no loss of generality in assuming that the A, are positive. 
Hence assuming that A, > 0 we get from (9) 


po a g(é) of iy On), n 1 my —G (> " 
ra (1 — 6,)"(1 — e(x)) F(a) 2» A,e < F@) », A,g(ne 
No-l 
< (1 + 4)" + ea) S a yee. 
Hence 


g(é) my Gin) 1 o g(t) - G(s), n 
11) (1-6, A. <A <(1+44,)’ lim % Pe 
an ¢ ” fim 52) 4°" =? (1 + 80) Oe (a) et 


If in (1) we place g(n) = 1, then F(x) = F(x) and we have 


No-l : 
(12) lim e Gi” = I, 
If now we place A, = 1 in (11), then A = 1, and combining with (12) we get 


(13) (1 — 6,)° lim an <islim(i+s er. 
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Since 6 is arbitrary, we may set 6, = 0 in (13) and thus conclude that 
ae 
(14) 9 _—— xz— R). 
F(a) ~ F@) ities 
Substituting this result in (11) we have 


No-1 


o 1 —G(n) nm 
— 6, Ane 
C~ 6 Tin re ee 


(15) 1 N2-1 
< A < 1 5, ¢ lim A, p Gin) » 
= = ( ° z—R F,(z) p>) Ane , 


If in (3) we set g(n) 1, then F(x) = F,(x) and we have 


Il 


p OCn) _ —Gi(n) x" 
= im py Ase 2 = lim pty Ae 
Combining (15) and (16) we have 
17 Oat y ee  & A, —G(n) on < 1 bi o lim Ane —G(n) x". 
(17) ( 8) Him poy es" SAS (1+ ) im ty D 
And since we may place 6, = 0 in (17), we conclude that 

ea x" a 

(18) lim pa Do Ane =A. 


Lemma 6. Let {A,} be a bounded sequence of numbers, |A,| < K, and 
suppose that 


lim - > A,e 72" = A. 


zk FG ) 4 
Let the function A(x) be defined as follows: 
(i A(x) = Ajay + (x — [2] (Apes. — Apes) (x 20), 
A(z) = 0 (x < 0). 


Finally, let H(x) ~ G(x) asx — ~%. Then 


lim WV “ox H(z) [ae + ze" at = A. 


Since H(x) ~ G’’(x), we have 
(1) H(x) = G(x) + e(x)@"(z) (e(z) > 0,2 > ~). 
By B, we may assume that the function ¥(z) of II satisfies the conditions: 
1 
(2) ¥(z) aod th 


1 
a 2) = Laney} 
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aszr— «x. Let 


4 ar ee ae v(é) | 
” meme Mn L/D 
From (4) and II we have 
(5) VG"(N2) ~ VG") (> &). 
Hence 
. OS =O OTS « =. 
We may then apply Lemma 5 and get 
a tf all 
By (16) in Lemma 5, we have 
Ne2-1 

8 li rr a hh, 
®) pal Fie yee Ane , 
Now 

TAs) ,~G(n)+G(§) +(n—§) log z 
” Taz) ~ : 


and by Taylor’s theorem 


(n 


(10) G(n) = G&) + (n — HG"(E) + Y a, ) €S& Snornsé, S28). 


Combining (9) and (10) we have 
(11) T. _ Fe re Oe ae Fe) 


Now if N; S n S No, then 


(12) lf —é| s in-e)sm—e=[ 4/28 | 


Hence by condition II, 


2) = Q” 2} 
(13) G'"(é,) = @(é) +04 we) f° 
Hence 
_ nn _ ye oe} 
(14) H(é) — G’'(é,) = p(€)G"(&) + of We) I 


Combining (14) and (2) we see that 


orgy = AO} 
(15) H(&) — @'(é,) oe 

















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 659 


Hence 


(16) l(n— O{H® —G@E,)} | S$ (Me — pf FOl — 


uniformly for all n such that Ni; < n S Nz. Now, log x = G'(y), where 
& = [y+ 1]. Hence, since G’(z) is an increasing function of z, 

(17) 0 < G(é) — logx = G’(é) — G’(E — 1). 

Hence, if N,; Sn Nz, 


|(n — &){log x — G’(é)} | S (Ne — HIG’ — GE — 1)} 
Ss (N2 — HG" — 1) 


VO ang — 1) ~ a/@G" 


Combining (18) with (3) we see that 


(18) 


(19) | (n — £){log x — G’(é)} | = o(1) 
uniformly for all n such that N; S n S Nz. From (11), (16), and (19) we get 
(20) Ta = {1 + en(€)} Tye OO 


(where lim ¢,(¢) = 0 uniformly for all n such that N; S n S Nz). From (20) 
S ead) 


we conclude 


lim -2 +> A, T,(z) lim T(z) ¥ A, {1+ en(t)}eo" £)2(8) 
z—R F,(z) Ni+1 z—R F,(z) n=N +1 


(21) 


lim T,(z) >i Ape 888 


zn Fy(x) syv1 


In particular, if we place A, = 1, then A = 1, and we get from (21) 


(22) lim Ze) b a Grows = 1. 


z—R Fy (x) N)+1 


Now it is easy to show that 


N2-1 o — 

—H(n—) 2H (8) —}t2H (8) 2r 
(23) _ es e [ e H© 
Thus, we see that 

T(z) , /H() 

i Fa) ~ Vor 
And combining with (21) we have 
(25) lim H(é) +> Aye Paw a A. 


g-+0 7 Ny+1 
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From (25) it is possible to conclude by the sort of argument used by Hardy and 
Littlewood that [1, p. 39, Lemma 2.13] 


(26) lim Ve / Apne "dt = A. 
zr T —2 


And finally, by an argument which is the precise analogue of that given by 
Vijayaraghavan, we can conclude from (26) that [11, p. 322, Lemma 2] 


z—2 


(27) lim a&) [. 1(¢+ xe bePats) de ae A. 


We now introduce the following lemma which was proved by Valiron [9, 
Section 11, p. 278}: 
Lemma. Let K(x) be an increasing function of x which satisfies the conditions 


(i) lim K(z) = ©, 

r+. r 
(ii) AK(x) < rK'(x) < BK(z), 
(iii) | aK’ (x) | < CK(z) 


for positive A, B, C and all large x. Let f(x) be a bounded function of x, and 
suppose that 


l * : 
lim / f(t + K(a))e "dt = 8; 
rr J 2 


z—x 


then 


a aa 
, a " -at?/ 
lim 4/ —- | f(t + K(z)e"" "dt = s 
—) Tr = 
for everya 2 1. 

From this lemma we obtain very easily 

Lemma 7. If A(x) ts a bounded function of x, and if 


; He) [° en) 
lim VV 5 [ia + xe dt = A, 


E ied) 


then 


_ > aH(z) [* 4 4 ge" ay = A 


zs 


for every a = 1, where H(x) is the function defined in condition III. 

To prove this, we observe that the function K(x) of condition III satisfies 
the conditions of Valiron’s lemma. Thus, if we replace the parameter x by 
2/H(2x) in this lemma, we get Lemma 7. 

Vijayaraghavan has proved the following [11, p. 324, Lemma 4}: 








ies 
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Lemma. If 6 be any positive constant and if a — =~, then 


a rérh ‘ 
I, = 4/ — ee dt 1 
Wwe bz! 
uniformly in x,asa—> ~. 


If in this lemma we make the substitution z = 1/H(y), it takes the form of 
Lemma 8. If 6 ts any positive constant, then 


a(H(y))-4 
I, = V all (y) ew) as 
7 —5(H(y))-4 


uniformly in y, asa— ~, 

Lemmas 1-8 furnish us with complete analogues of the lemmas which Vijaya- 
raghavan uses in proving the Tauberian theorem of Borel summability. The 
proof of Theorem I can now be completed in the same manner as that used by 
Vijayaraghavan. 

II 

As was stated at the beginning of this paper Theorem I contains Borel 
summability as a special case, but not Abel summability. We turn now to a 
theorem which is a generalization of the Tauberian theorem of Abel sum- 
mability. 

TueoreM II. Let (x) be an increasing function of “regular growth’’, which 
tends to infinity with x; 


a » L(vz) 

= 2°L > 0: ~— 

(x) = x* L(x) (« > 0; lim L(x) 

Let B(x) be bounded in every finite interval. Let the integrals 


[ a d®(u), [rev at [Be ama} 


exist as Stieltjes integrals. Then if 


; 1 — ‘ , 
lim rd + a) 8) I e a{ [ Blu) aw} = B (B finite), 


=1 for every > 0), 


and if 


lim (B(y) — B(x)) = 9, whenever oe —1 (yy2r->»), 
then 
lim B(x) = B. 


z—2 


Lemma 1. If 6(N)/#(x) — 0, then 


1 ~ —tiz 
a e '* d&(t) > 0. 
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For 
1 I “eae = 1 [ * ao) 
a) (x) Ja (x) 
a #(N) — 2a) _, 0 
#(z) 


by hypothesis. 
Lemma 2. If &(M)/®(x) — ~, then 


1 = —tiz 
= |, e det) > 0. 


From the representation 


(1) Lis) = oa)", 
it follows that 
(2) L(x) = o(z‘) 


for every fixed « > 0, and a positive constant K exists such that if ¢ is a fixed 
positive number, then for all large zx and M > z we have 


o(M) _ e(M) freee of” sti _(M\*** 
@ Fa ~ a) ae 


From (3) we conclude that if (M)/#(z) — , then 


(4) ae 
zr 
From (2) we have 
(5) (x) = 2° L(x) = o(x***) (x — @). 


Integrating by parts we have 


—tle P(t) els 1 = —tlz 
ay |, « de(t) = |, + — —e  @ 


#(z) ° 
an —Miz #(t) evtlz 
Foe + fee") 
Substituting (3) in (6) we get 


#(z) 5 de  d@(t) < x(™ ye —Mie £( 


(6) 


(7) 








Ae 





ed 


(:) 


™ du, 
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and since, by (4), M/z — ~, we conclude from (7) that 


1 w —tiz #(M) 
(8) wo |, e dv(t) + 0 (3 — ), 


Lemma 3. If &(N)/#(x) — 0 and 6(M)/®(r) — ~, then 


1 M as 
ee * db . 
rd + ats | ipa 
It has been shown by Karamata [4, p. 296, Theorem I; p. 298, footnote 9] 
that if the function (z) is a function of regular growth which tends to infinity 
with z, then 


(1) lim 5a IHG [- 


If we combine (1) with Lemmas 1 and 2, Lemma 3 follows directly. 
Lemma 4. If #(M)/®(x) > ~, then 


1 fo (t) —t/z 

35 |. log (eh. de(t) — 0. 
Be &(t) \ we 1 f° 0 us 
as |. log (aan "ae <a | am? tt 


(t) e *d 
“al Haj’ | O. 


~"= dp(t) = 1. 


But for all large x, 


a). .(t\"* 
@) a2) < x(‘) 


and since M/x — ~, it follows that there exists a positive number 6 satisfying 
0 < 6 < 1 such that, ift = M, 


(3) K(‘) <7". 
zr 
Substituting (2) and (3) in (1) we get 
ex P(t) | we. 1 —(1-8)tlz 
(4) Ha) i, log ‘a O\. dP(t) < aM |v € d®(t). 


Integrating the right side of (4) by parts, we have 


1 “= (t) eos 1-6 [°O(t) ~a-sez 
. san |, one = Se | + Sa ee 


—(1—8) M/z _ » Oz) [OO -c-ouse ft 
= +O Dea Ju (2) ° a(*) 
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Since M/z — «, and #(M)/#(x) — ~, it follows from (5) in precisely the same 
manner as in Lemma 2 that the right-hand side of (5) tends to zero. Hence 
we see that 


ee s. (0) | sie a9 #(M) 
(6) #(z) I log ‘eh d&(t) > 0 (Go > ), 


By an argument similar to that used by Vijayaraghavan (10, p. 114, Lemma 6], 
t 


if we make use of the fact that B(u) d®(u) exists as a Stieltjes integral, we 


can prove 
Lemna 5. If lim (B(y) — B(x)) 2 0 whenever ®(y)/#(x) —- 1 (y 2r— &), 
then corresponding to any positive number c there exists a positive constant k = k(c) 


such that 
B(y) — B(x) > -{k log eu + ch 


for all large x and y 2 z. 

Making use of Lemmas 1-5, which correspond to lemmas in Vijayaraghavan’s 
paper on Abel summability [10], we can by an argument similar to Vijayaragha- 
van’s prove 

Lemma 6. Under the conditions of Theorem II, if 





ae) [ g™ at [" Be amu} = O(1) (rx «), 
and if 
lim (B(y) — B(z)) 20 whenever oe +1 y22—>2), 
then 


B(z) = O(1) asx — ~. 


From Lemma 6 it follows that the function B(z) of Theorem II is bounded. 
Hence there is no loss of generality in assuming that B(x) is positive. As- 
suming then that B(z) is positive, we write 


t 
(1) Aw) = [Bo ao(e. 
Since $(u) is an increasing function of u and B(z) is positive, it follows that 
A(t) is an increasing function of t. Thus the hypothesis of Theorem II may 


be put in the form 


(2) [oetaiaos ~ 10 + oe) —t 





where (rz) = x“L(x) (a 2 0), and A(t) is an increasing function of t. 
From (2) it follows by a theorem of Karamata that [5, p. 30, Theorem I] 


(3) A(x) ~ ®(z) (x — ~), 





at 
ay 
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i.€., 


, 1 - 
4 lim Blu) d&(u) = B. 
(4) tim ty |, Bled dow 
The following lemma now enables us to complete the proof of the theorem. 
Lemma 7. If G(x) is an increasing function of x which tends to infinity with x 
and which satisfies the condition 


. i Gey = Le Gay 
if 
lim G3 [ Ru) dG(u) = R (R finite) ; 
and if 
lim (R(y) — R(x) = 0 whenever Ge) ae Cebus 
then 


lim R(x) = R. 
This lemma is a generalization of a lemma proved by Szasz [8, p. 337, (18’)] 
and can be proved in the same way. There is merely one point in the proof 
which requires special attention. Following Sz4sz we set up the identity 


R(x) {G(y) -- G(2)} = [ ” R(x) dG) = [ ” RO dG 
(1) . ; : 
. I RW) AG) — | (RQ) — R(x) dG. 


To make the analogy complete, we should have to choose y so that G(y) = 
(1 + 6)G(x), where 6 is a positive number. Since G(z) is not necessarily a 
continuous function of z, it is not possible in general so to choose y. However, 
it is sufficient for the purpose of this proof if G(y) ~ (1 + 4)G(z) asx — &. 
Now since G(x) tends monotonically to infinity, it is clear that y = y(x) can 
be found so that 


(2) Gy — 0) S (1 + 4)G(xz) S G(y + 0). 


Combining (2) with (i) we see that G(y) ~ (1 + 6)G(z). The remainder of 
the proof of this lemma can be carried out in precisely the same manner as in 
Szasz’s proof. 

It is easy to show that condition (i) of Lemma 7 is satisfied by the function 
(x), since (x) is of regular growth. Hence, since 


lim xD [ B(u) d&(u) = B 


Io 








1. 


10. 


11. 
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and lim (B(y) — B(x)) = 0 whenever #(y)/#(z) > 1 (y = x — ~), it follows 
from Lemma 7 that lim B(r) = B. 
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ASYMPTOTIC EXPRESSIONS FOR THE ZEROS OF GENERALIZED 
LAGUERRE POLYNOMIALS AND WEBER FUNCTIONS 


By ViviAN EBERLE SPENCER 


Introduction. It is the purpose of this paper to apply the close relationship 
between Hermite and Laguerre polynomials to find asymptotic expressions for 
the zeros {,2in} of the generalized Laguerre polynomial L,(z, a). The results 
obtained for the largest zero ,z,, are, we believe, new; for the other zeros the 
expressions are essentially equivalent to those obtained by Winston;' the method 
of procedure, however, in every case is new and fruitful. Hermite functions 
h(x, n) are the special case of Weber’s parabolic cylinder functions w(z, n) 
obtained when the boundary conditions w(+ «, n) = O(r"e*’) are imposed. 
The argument is simplified and the order of some of the results improved by the 
introduction of the latter functions for non-integral n. Hence we are led to a 
discussion of the zeros {.2in} of w(z, n). By an elementary application of 
Sturm’s theory Milne’s’ properties of the zeros {,.2in} of the standard solution 
are obtained, and similar properties for the zeros of the solution converging to 
zero as x approaches minus infinity are developed. The application of the 
known asymptotic expressions for the zeros of Hermite polynomials is shown 
to give directly bounds and asymptotic expressions for the zeros of Weber 
functions for n an arbitrary positive number. A sequence of Weber functions 
w(z, a, n) is associated with every sequence of Laguerre functions l(z, a, n), 
and definite separation and asymptotic relations are obtained between the zeros 
of w(z, a, n) and U(r, a, n). As a consequence of these relations asymptotic 
expressions for all zeros of L,(z, «), for any a > 0, are obtainable immediately 
from any asymptotic expression for the zeros of the Hermite polynomial H,(z), 
or for the zeros of the Laguerre polynomial proper L,(z, 1). Neumann’s® 
bounds for the zeros of L,(z, 1) and Zernike’s* asymptotic expression for the 
largest zero of H,(x) are then applied to L,(z, a). 


1. An asymptotic expression for .z,, for } < a < 3. We consider the Her- 
mite and Laguerre polynomials satisfying respectively the differential equations 


(1) H(z) — 2cH,(z) + 2nH,(z) = 0, 
(2) aL’ (x, a) + (a — x) Li (zx, a) + nb,(z, a) = 0 (a > 0). 


Received January 4, 1937; in revised form, July 8, 1937. 
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The polynomials H,(x) and L,(z, @) satisfy the relations 


(3) H>,(x) = L,(x’, 3); Honii(z) = 2L,(2", §). 
Zernike’s asymptotic expression for the largest zero y2x,, of H,(x) is° 
w u&nn = (2n + 1)'? — 1.8557571 (2n + 1) "* — 0.3443834 (2n + 1) ** 


— 0.168715 (2n + 1)°*°* — 0.151965 (2n + 1)°* + Of (2n + 1) 7"). 


Applying (3), we obtain immediately an asymptotic expression for the largest 
ZeTO 1Xn», Of L(x, a) for a = 4, 3: 


((a = 4) pan, = 4n + 1 — 3.7115142 (4n + 1)"* + 2.7550676 (4n + 1)" 
+ 0.940754 (4n + 1)' + 0.440858 (4n + 1) °° 

+ Of(4n + 1)“*} = A(n, }), 

3) 1tan = 4n + 3 — 3.7115142 (4n + 3)"* + 2.7550676 (4n + 3)? 
+ 0.940754 (4n + 3)' + 0.440858 (4n + 3)~°° 

( + Of(4n + 3) °°} = A(n, #) 


Markoff’s® theorem, applied to Laguerre polynomials, gives 


- 
R 
II 


(6) 0 uZin >0 (i = 1,2,---,n). 
da 


Then (5) and (6) yield 


(7) | (a > 3) inn > A(n, }), 
4 

(0 < a < $) LXnn < A(n, 3). 
Hence 


(8) (§ <a@ <3) itn = 4n + 2a — 3.7115142 (4n + 2a)'* + O(1) 
(O(1) < 2). 


Note that (8) includes the case of Laguerre polynomials proper, corresponding 
toa = 1.’ 


A more delicate analysis is necessary to extend these results to any a > 0. 


5 The remainder term does not appear explicitly in Zernike’s expression, but is implied 
in his argument since he proves that an asymptotic expansion for yz,, in powers of 7 = 
2n + 1 exists. 

® J. Shohat, Théorie générale des polynomes orthogonaux de Tchebichef, Mémorial des 

Sciences Math., Fasc. 66, p. 39. 
7 By a method based on Zernike’s argument for Hermite polynomials, but not utilizing 
the intimate relation between h and l apparent in (10) and (21) below, W. Hahn, Die Null- 
stellen der Laguerreschen und Hermiteschen Polynome, Dissertation, Berlin, 1933, was able 
to show that 4n + 2a — ¢,(4n + 2a)! < ponn < 4n + 2a — c2(4n + 2a)! (a > 0; ¢1, ¢, > 0), 
for n sufficiently large. (8), which has been obtained from Markoff’s theorem with scarcely 
any argument, is, for } <a < 4, a better result. 
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2. The zeros of Weber’s parabolic cylinder functions. The Weber equation 
may be written in the form* 


(9) w’ + (Qn +1 — 2’)w = 0. 


When the boundary conditions w(+ «, n) = O(r"e*’) are imposed, the only 
solutions of (9) are the Hermite functions {A(z, v)}, A(x, v) = H,(r)e™”’, satis- 
fying the differential equation 


(10) h’+(&+1—2),h=0 (» = 0,1,2,---). 


To show this, assume that there exists for n # v a solution of (9) satisfying 
these boundary conditions. Multiply (9) by A, (10) by w, subtract and inte- 
grate between any two limits a and b 


b b 
(11) (wh — wh’)' = Ay — ” | wh dz. 


co} 


Then letting (a,b) = (— «, «), we have / wh dx = 0, a result which contra- 


dicts the completeness of the set {A(z, v)}. 
Write (9) and (10) in the form 


(12) w’ — o(z, n)w = 0, o(z, n) 
(13) h” — p(x, v)h = 0, p(x, v) 


Then p(v — 1) > a(n) > p(v) forvy —1 <n<v. Hence, as a consequence of 
Sturm’s fundamental theorem, the zeros |2in} of w(x, n) separate the zeros 
{w%iv—1} Of A(x, vy — 1) and are separated by the zeros {x2} of A(z, v). It 
follows that w(x, n) has at least » — 2 and at most v + 1 zeros. 

Consider (11) with A(z, v) replaced by h(z, » — 1), and (a, b) = (atv, k) = 
(7, k), where k is an arbitrary constant to be properly chosen. We have 


a(n) = x — 2n — 1, 


p(y) = z* — 2 — 1. 


I 
II 


k 
(14) w'(k)h(k) — w(k)h'(k) + w(r)h'(r) = 2(v — 1 — ” | wh dz. 
If w’(r)h(x) and w(xr)h'(r) — 0 as x — x, choose k = «. Since A(x) > 0 
for xz > 7, h’(r) > 0, then n > v — 1 implies that for both members of (14) 
to have the same sign it is necessary that gt) < olen, Where ptm, is the 
largest zero of w(x, n). 

But if either w’(x)h(x) or w(xr)h'(x) does not approach zero as r — ~«, then 
since h(x) and h’(r) — Oas a — x, w(r)w'(xr) > © asx — ~ and hence for 
some value k of x sufficiently large h’(k) < 0 and w’(k)w(k) > 0. Choosing 
this value for k in (14), and supposing sgn w(x) constant for x > +, then for 
n> v — 1 the sign of the left member is determined as sgn w(r) and the sign 
of the right member as —sgn w(x). This leads to a contradiction. Hence 
whatever the asymptotic behavior of w(x) we have y2,—-1—1 < wmn- 


* This is obtainable from the standard form Di’ + (n + § — }2*)D, = 0 by the substi- 
tution w(x, n) = D,,(2) 2). 
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A similar argument applied to the interval (a, b) = (k, wry-1) leads to the 
inequality w2in < #%y»-1, Whatever the asymptotic behavior of w(x). Hence 
w(z, n) has either v or vy + 1 zeros. 

Consider now the standard solution of the Weber equation, defined by Whit- 
taker’ and discussed by Milne,” for which w(x) = O(e** zr”), p a constant, 
z large and positive. In (11) let (a, b) = (w2mn, ©) = (nm, ®). 


(15) —w’(r)h(m1) = 2(v — n) [ wh der. 


Since w’(7,)w(x) > O for z > »2mn, then vy > n implies that for both members 
of (15) to have the same sign it is necessary that om. < #2», and hence that 
w(x, n) has exactly v zeros. Moreover, since the zeros of w(z, n) are separated 
by the zeros of A(z, v), these results imply wtin < #@%y . 

For the solution of the Weber equation for which w(x) and w’(r) — 0 as 
x — —~© a similar argument leads to the inequality 421, < 2m. , and hence 
to the conclusion that this solution has exactly v zeros, and that yt, < wma." 

Since the conclusions of the two preceding paragraphs cannot hold simultane- 
ously, it is clear that a solution of (9) for non-integral values of n cannot con- 
verge to zero at both + and — ~. 

Summing up our results, and recalling again Sturm’s oscillation theorem, 
we have: 

1. Milne’s properties of the zeros {xin} of the standard solution of the Weber 
equation are: If v is an integer and vy — 1 < n & », then w(z, n) has v zeros. 

If v is even, 4v zeros are positive and 3v negative. If vis odd, }(v — 1) zeros are pos- 
itive and 3(v + 1) negative or zero (zero only for n = v). Moreover, as n increases 
all zeros of w(x, n) increase. 

2. For the solution for which w(x) and w'(r) ~ 0 as x > —~: If v ts an 
integer and v — 1 <n S&S », then w(x, n) has v zeros. If v is even, } v zeros are 
positive and hv negative. If v is odd, }(v + 1) zeros are positive and }(v — 1) 
negative or zero (zero only for n = v). Moreover, as n increases all zeros of w(x, n) 
decrease. 

3. For any solution of the Weber equation: If v is an integer andy —1 <n Sv», 
then w(x, n) has either v or vy + 1 zeros. If v is even, at least }v zeros are positive 
and 4yv negative. If v is odd, at least }(v — 1) zeros are positive and }(v — 1) | 
negative. | 
These properties of the zeros of Weber functions enable us to extend at once : 
bounds for the zeros of Hermite polynomials to the zeros of any Weber function. 

In particular, for the standard solution of (9), defining v2.1 as — ~, 





(16) AZi-w-1 < wlin < wiv (»—1l<n<»p;t = 1,2, ---,»). 


*E. T. Whittaker, On the functions associated with the parabolic cylinder in harmonic 
analysis, Proc. London Math. Soc., vol. 35 (1903), pp. 417-427. 

10 A. Milne, loc. cit. 

11 Incidentally, this solution is obtained from the standard solution by changing z 
into —z. 


— 
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Thus, Winston’s bounds for the zeros of Hermite polynomials give for »y —1 <n <»v 


2% —~»—2+4 2-6 2'(2i — » + 1) ) 





2iy! <tu< Tapp (UG +8, --- 10 
ae elas 83 7) (= 2,-..,v) sitet 
(17) 
a S. < Jutin| < Hy = 32 (i = 2,---,3(v — 1)) }v odd 
ee et 5G = : - = < wlyo4+yn <0 





Moreover, the standard solution of the Weber equation, like the Hermite 
functions, vanishes together with its derivatives for x — «, and their differ- 
ential equations are identical in form. These being the only properties of 
Hermite functions employed by Zernike in obtaining his asymptotic expression 
for 72, , We may at once write 


(18) uitmm = (2n + 1)? — 1,8557571 (2n + 1)”* — 03443834 (2n + 1) *" 
— 0.168715 (2n + 1)*? — 0.151965 (2n + 1)7** + Of(2n — 1)7”'}. 


3. Laguerre functions and associated Weber functions. In (2) let l(z,a,n) = 
l= 2% te *’L,(2", a), a transformation carrying reals into reals for any positive 
aand z. Then, 


r 


(19) 1” + (4 i‘ie~¢.S= #)(a — DY = 0, 


( a>Q0O; ) 
° — oa l ™ ae 0 2nt+a—) —$r? . 
(19a) 1” + (4n + 2a — xl = (a eS 3) L, (+) (x “Yogi 


We are thus led to consider the equation 
(20) w” + (4n + 2a — 2’w = 0 (a > 0), 


upon which we wish to impose a boundary condition of the form w(+*) = 
O(a‘e**), k > 0. But (20) is the Weber equation with 2n + 1 replaced by 
4n + 2a; and the boundary conditions are those of the standard solution. 
Hence, a solution of (20), satisfying these boundary conditions, exists for 
every n and a; and for n and a@ such that 


4n + 2a — 1 


(21) ye—l< 5 


IIA 


vy, vy a positive integer, 











672 VIVIAN EBERLE SPENCER 


w(x, a, n) has exactly » zeros. The set of functions {w(z, a, n)} so defined for 
given n and @ shall be known as the associated Weber functions corresponding 
to the Laguerre functions {l(z, a, n)}. 

Condition (21) may be rewritten as }(2y — 2a — 1) < n < }(2v — 2a + 1). 
The characteristic numbers giving rise to the Laguerre functions are n = 


0, 3, 1, $,---. The corresponding Laguerre functions have respectively 
0, 1, 2, 3, --- zeros, arranged symmetrically with respect to the origin. Let us 
consider the associated Weber functions w(z, a, n), n = 0, 3, 1, 3,---. If 


N(x) denote the next integer = zx, these Weber functions have respectively 
v(?2 7 '), u(t '), a(t *, --+ zeros. Hence, the number of zeros 
of I(x, a, n) equals the number of zeros of w(z, a, n) for 0 < a S 3, and in 
general for any a > 0 the number of zeros of w(z, a, n) exceeds the number of 
zeros of U(x, a, n) by v(?2 7 '), 

Write (19) and (20) in the form 


~~ 
(22) Il’ —p(z,a,nl=0, plz,a,n) =p =2° — 4n— 2at (ce oe a) 
(23) w”’ — a(x, a, n)w = 0, o(z,a,n) =o =x — 4n — 2a. 
Then for every z and n 
psc (2 Sa & 9), 
(24) 
p>o (0 <a < }, ora > $). 


Hence Sturm’s fundamental theorem gives: 1 oscillates more rapidly than 
w for } < a@ < $, and » oscillates more rapidly than l for 0 < a@ < 3, or 
a>$@=lIfora = }, #). 

Denote the zeros of I(r, a, n) by {fi}, 7 = 1, 2,---, n, and the zeros of 
w(x, a, n) by {6;}, 7 = 1, 2, --- , m, where the zeros in each set are arranged in 
increasing order of magnitude. 

Multiply (19a) by w, (20) by J, subtract and integrate, 

x 


(25) (l'w — lw’)' = (a — s)(a — » | =" dx, 
ik  » 


where k is an arbitrary constant to be properly chosen. Since , w’, l,  — 0 
as x — x (25) becomes 


(26) '(k)wlk) — I(k)w'(k) = —(a — })(a — §) I . dx. 
= 
First, let k = ¢,; (26) reduces to 


(27) U(¢,)w(F.) = —(a — $a — 9) | Me air. 


th a 











> 0 





ise ne. 
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¢, is not a multiple zero of I(x) and U(x) remains positive for zx > ¢£, ; hence 
l'(¢,) > 0. This for 0 < a < } ora > } implies ¢, < 6,,. Next, let k = 6,,, 
by similar reasoning 


(28) L(Sm)w’ (bm) = (a — 3)(a — §) I - dx. 


But w(z) has no multiple zeros and w'(é,,)w(x) > Oforz > 6,. For} <a < # 
this implies 6,, < ¢,. Hence, we have 

TueoreM I. The zeros {¢;} of the Laguerre functions U(x, a, n) and the zeros 
{6;} of the associated Weber functions w(x, a, n) satisfy the following separation 
relations 


(O<a<}) b6.4<6 <4, fore l,---,m, meven, 
(29) i} <a < }) Siva <b < oi, { 

(a > 3) 6:1 < 5 < Pinw(?9>*) = bm , jt = at! alia n, n odd. 

2 \ 


4. Asymptotic expressions for the zeros of L(x, a). The results of §2 com- 
bined with Theorem I give us a method whereby results established for the 
zeros of Hermite polynomials immediately yield results for the zeros of gen- 
eralized Laguerre polynomials for any a > 0. The zeros of Hermite functions 
are identical with the zeros {,2;,} of the corresponding Hermite polynomials. 
Between any two of these zeros there lies one and only one zero of the set 
|.2im} Of zeros of the standard Weber function w(z, m) for which n = N(m). 
But the associated Weber functions, w(z, a, n), a particular case of w(x, m), 
lead by Theorem I at once to relations for the zeros {¢;} of the Laguerre func- 
tions I(z, a, n). Results for the zeros {.2;,} of L,(z, @) are then obtained by 
squaring the {f¢;} and replacing n by }n. Moreover, our method permits us to 
extend results for Laguerre polynomials proper, a = 1, to the generalized 
Laguerre polynomials. In fact Winston,” applying Markoff’s theorem, has 
indicated the following method for reducing results for Laguerre polynomials 
(a = 1) to results for Hermite polynomials. 

To indicate the dependence of :zi;, on a@ write .2in = 1Zin(a). Then rela- 
tions (3) and (6) give 


—nFrn1-i.2n = n%in = Vti—nn(h) < Vrti—nn(1) 
G=n+1,n+2,---, 2n) 
—n@ony2-i2nes = wine = V1%i—n—ind) > WV ti-n—1n(1) 
GG =n+2,n+3,---,2n+ 1). 


(30) 


"(. Winston, loc. cit., p. 676. 
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Hence, since y%in > x@inii (1 < t < 2n), 


V 1ii-n-in(1) < —adengiitn = wien < V 1i-nn(1) 
(@=n+1,n+2,---,2n), 


(31) 


where we define the number ,2,.(1) = 0. 
Thus, extending Neumann’s” results for Laguerre polynomials a = 1 to 
generalized Laguerre polynomials the process of the preceding paragraph gives 


i—n rs 2¢ —n+1) 
(32) 2(n + 1) < —HTensy1-i,2n = Zien < “(n+ 1) 


(@=n+1,n+2,---, Qn). 
If we apply §2 and Theorem I, (32) gives’ 


Gg. . 4(i + 1)? is 
4(n + 1) < LZin < n+ i (0 <a< 27t= 2, 3, ade n), 
G-1* ~  -46+2) a 

(33) 441) SS nT iil oii chin alata 


2 
uae ai +N = +!)) 

G -2 va a a oe 2 
4(n + 1) L+in ™ 
Replacing 2n + 1 by 4n + 2a in the asymptotic expression (18) for u2an, 

we obtain as an asymptotic expression for the largest zero 5m of w(z, a, n) 
bm = (4n + 2a)"* — 1.8557571 (4n + 2a) * — 0.3443834 (4n + 2a)** 

(34) — 0.168715 (4n + 2a)~*? — 0.151965 (4n + 2a)*** 

+ O{(4n + 2a)~"*} = a(a, n). 


Let us now investigate the proximity, asymptotically, of 6, to {». Replace 
n in (20) by n’ < n and w by w, 


(35) w: + (4n’ + 2a — 2°) = 0. 
Multiply (19a) by w: , (35) by 1, subtract, and integrate, 


(a > 3,1 = 2,3, ---,n—1). 


2 oe oe 
(36) (I'w, — lo:)| + 4(n — n’) | luxdx = (a — 4)(a — a | ar, 
k k k 
whence 
’ a 
U(k)wi(k) — U'(k)wi(k) = —4(n — n) | loxdx + (a — 4)(a — a | =: de. 
k s = 
18 E. R. Neumann, Beitrdge zur Kenntnis der Laguerreschen Polynome, Jahresbericht der 
Deutschen Math. Vereinigung, vol. 30 (1921), pp. 15-35. 


4 C, Winston, loc. cit., p. 675, has obtained results essentially equivalent to these by a 
geometric argument similar to Neumann’s. 























n). 


n), 


n). 


lace 








Seenuuneaiienasant ats 











ZEROS OF GENERALIZED LAGUERRE POLYNOMIALS 675 
Proceeding as in the previous section, let k = 6,,, the largest zero of w . 
87) Wow)wilbn) = Ain — 0) [nde + (@— Dla 9) [Mae 
Consider (37) for0 < a < }ora > }. Assume U(x) > Oforz > 6,-. Since 
w1(Sm’)w3() > O for x > dm, (37) implies 


4(n — n’) [ laxdz < (a — 4)(a — 9) [ lo dz 
bm’ 8m’ bd 
1 — 2 
i” oie — §) [ lw, dz, 
a Ben’ 


(a mii 3) (a -%) ; 





4(in—n')< 2 
This inequality will be contradicted if 
, (a — 3)(a@ — 3) 
38 <n— _——-. 
(38) n n 482, 


Hence, for n’ satisfying (38), 6.. < ¢,. Combining this result with (34), 

we have 

(39) a(a, n’) Sf S ala, n) (0<a< }ora > $). 
Similarly, replace n in (19a) by n’ < n. Then reasoning analogous to that 

used in deriving (39) gives 

| (@ — 3)(a — §)| 


40 tn <6, for n’<n— 
sia 46°, 


(3 <a < §). 


Hence, for n’ satisfying (40) 
(41) a(a, n’) S Sw S ala, n) (3 <a@ < §). 


But the first term of (34) which will be affected by replacing n by n + O(n™) 
is the term in n*?, hence (33), (34), (39) and (41) lead to 

THEOREM II. If {:2in} denote the zeros of the generalized Laguerre polynomial 
L,(x, a), arranged in increasing order of magnitude, then for any a > 0 bounds for 


these zeros are given in (33), and also for any a > 0 

inn = 4n + 2a — 3.7115142 (4n + 2a)"*® + 2.7550676 (4n + 2a)™* 

(42) ad 
+ O(n). 


UNIVERSITY OF PENNSYLVANIA. 








REMARKS ON THE PROBLEM OF PLATEAU 


By E. F. BecKENBACH 


1. Introduction. We shall consider the problem of Plateau in the following 
form. 

PROBLEM OF PLATEAU. Given a Jordan curve T in xyz-space, determine 
functions x(u, v), y(u, v), 2(u, v) which are continuous for u’ + v° < 1, are har- 
monic and satisfy E = G, F = 0, where 


E=r.t+yita, Fenntyptar, Gaert+yt+a, 


foru’ + v < 1, and map vw + vw = 1 ina topological way on YT. 

Any set of functions satisfying the above conditions are coérdinate functions 
of a minimal surface bounded by IT and given in isothermic representation. 

The following theorems have been proved. 

THEOREM 1. If T bounds some surface, of the type of the circular disc, with a 
finite area, then the problem of Plateau is solvable for T. 

THEOREM 2. The problem of Plateau is solvable for an arbitrary Jordan curve T. 

Theorem 1 has been proved separately and at about the same time by J. 
Douglas and T. Radé.' Subsequent proofs have been given by E. J. McShane’ 
and R. Courant.’ Theorem 2 has been proved by J. Douglas (loc. cit.), and 
later, by means of a different method but the same lemmas, by T. Radé.* In 
what follows, we consider alternative proofs of this latter theorem. 

In proving Theorem 2, Douglas, assuming Theorem 1, first uses a limiting 
process to establish the existence of mapping functions. He then completes 
the proof by using the following two lemmas to show that the functions thus 
obtained map u’ + v° = 1 topologically on I. 

Lemma 1. Let x(u, v), y(u, v), z(u, v) be harmonic and satisfy E = G, F = 0 
foru’+v <1. Suppose x(u, v), y(u, v), 2(u, v) remain continuous on an arc o 
of u’ + vw = 1, and x(u, v) = const. = x, y(u, v) = const. = yo, z(u, v) = 
const. = z ono. Then x(u, v) = %, y(u, v) = yo, z(u,v) = a. 

Dovuauas’ Lemma. Let the integrable functions iy), nly), ¢(¢), substituted 


Received July 12, 1937. 

1 Their results are summed up in the following papers: J. Douglas, Solution of the 
problem of Plateau, Transactions of the American Mathematical Society, vol. 33 (1931), 
pp. 263-321; T. Rad6, The problem of the least area and the problem of Plateau, Mathematische 
Zeitschrift, vol. 32 (1930), pp. 763-796. 

2E. J. McShane, Parametrization of saddle surfaces, with application to the problem of 
Plateau, Transactions of the American Mathematical Society, vol. 35 (1933), pp. 716-733. 

3 R. Courant, On the problem of Plateau, Proceedings of the National Academy of Sci- 
ences, U. 8. A., vol. 22 (1936), pp. 367-372. 

4 An iterative process in the problem of Plateau, Transactions of the American Math- 
ematical Society, vol. 35 (1933), pp. 869-887. 
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in the Poisson integral formula, determine the (harmonic) coérdinate functions of a 
minimal surface in isothermic representation. Let further &(¢), nly), f(¢) ap- 
proach definite limit values —(x), n_(3), ¢-(r) and &,(), x(x), £4() according 
as ¢ — m in clockwise and counterclockwise senses, respectively. Then 


E(r) = (nr), 9 o-(r) = n4(7), FC) = £4 (x). 


We have the following generalization’ of a theorem of Lindeléf.* 

LemMa 2. Let x(u, v), y(u, v), z(u, v) be harmonic and bounded and satisfy 
E = G,F = Ofor0 < arc tan (v/u) < a,0 < w+ <r. Suppose x(u, v), 
y(u, v), z2(u, v) remain continuous on the rayO < u < ro ,v = 0, and r(u, 0) > x, 
y(u, 0) > yo, 2(u, 0) —~ m% asu—- +0. Then in every sector 


v 2 2 2 
0 < are tan- <a—dao, u+uv<7, where « > 0, 
u 


we have x(u, v) > x , y(u, v) > Yo , z(u, v) — Zz as (u, v) — (0, 0) in any manner. 

Since Douglas’ lemma is a direct consequence’ of Lemma 2, it follows, as Radé 
has remarked,” that Theorem 2 is a consequence of Theorem 1 and Lemmas 1 
and 2. 

We shall call attention to two pairs of proofs, most of which have previously 
been given, of Lemmas | and 2, and then, reviewing the limiting process, shall 
obtain a proof of Theorem 2 from Theorem 1 and Lemmas | and 2. 

1. Proof of Lemmas 1 and 2 by means of subharmonic functions. A lemma 
which allows the immediate application of the Principle of the Maximum to 
minimal surfaces is the following.” 

A necessary and sufficient condition that the continuous functions x(u, »), 
y(u, v), 2(u, v) be harmonic functions satisfying E = G, F = 0, is that 
(2 — a)? + (y — b)* + (2 — c)}' be of class” PL for arbitrary choice of the real 
constants a, b, c. 

By means of the above lemma, Beckenbach and Radé give brief proofs of 
Lemmas | and 2, strictly analogous to proofs by means of the Principle of the 
Maximum of corresponding theorems concerning analytic functions of a complex 
variable. Actually, they prove Lemma 1 under the additional restriction that 
r(u, v), y(u, v), z(u, v) be bounded. But this restriction may be removed as 
follows. If x(u, v), y(u, v), z(u, v) satisfy the assumptions of Lemma 1, then 
in an arbitrary Jordan region R bounded by o + B, where o + B is a Jordan 


5. F. Beckenbach and T. Radé, Subharmonic functions and minimal surfaces, Trans- 
actions of the American Mathematical Society, vol. 35 (1933), pp. 648-661. 

* See Pélya und Szegé, Aufgaben und Lehrsdtze aus der Analysis, Berlin, 1925, vol. I, 
p. 138, problem 277. 

7 E. F. Beckenbach and T. Rado, loc. cit., p. 658. 

*T. Rad6é, On the problem of Plateau, Berlin, 1933, p. 73. 

* E. F. Beckenbach and T. Radé, loc. cit., p. 654. 

” A function p(u, v), defined in a domain D, is said to be of class PL in D provided 
p(u, v) is continuous and 2 0 in D and log p(u, v) is subharmonic in the part of D where 
p(u, v) > 0. 
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curve and every point of B is in u° + v° < 1, the functions z(u, v), y(u, v), 
z(u, v) are bounded. Map a + 6 < 1 conformally on the interior of R by 
means of the analytic function u + iv = f(a + 78); an are o’ of e+p=1 
will correspond to ¢. There are induced functions x = z(u, v) = X(a, 8), ete., 
which satisfy the conditions, with a, 8, o’ replacing u, v, ¢, under which Becken- 
bach and Radé proved Lemma 1, so that X(a@, 8) = x , Y(a, 8) = yo, Z(a, B) = 
2, Whence x(u, v) = 2, y(u, v) = yo, z(u, v) = 2. 


2. Alternative proofs of Lemmas 1 and 2. Radé has proved" Lemma 1 about 
as follows. Since by assumption the harmonic functions z(u, v), y(u, v), z(u, v) 
reduce to constants on o, it follows by the Principle of Symmetry that these 
functions remain analytic on o, and consequently the relations E = G, F = 0 
hold on ¢. Since, for an isothermic map, we have dz’ + dy’ + dz? = 
\(du® + dv”), where \ = E = G, and since dx” + dy’ + dz’ = 0 for (u, v) on o, 
it follows that E = G = 0 on a, and consequently 2, = 2. = Yu = Yo = Zu = 
z, = Oonce. That is, the functions x, — iz, , yu — TY, Zu — ty, Which are 
analytic functions of w = u + iv, vanish on an arc of their domain of regularity 
and therefore vanish identically. Hence, 2, = 2, = Ye = Ww =u =% = 9, 
so that x(u, v), y(u, v), z(u, v) are identically constant. 

We now offer a companion proof, based on the notion of normal families, 
of Lemma 2. 

Three functions x(u, v), y(u, v), z(u, v), harmonic in a domain D, are called a 
triple of conjugate harmonic functions provided they satisfy E = G, F = 0 in D. 
In conformity with analytic function theory, we shall say that a family of such 
triples constitutes a normal family of triples of conjugate harmonic functions in D 
provided that every infinite sequence of triples of the family contains a subse- 
quence of triples which converges uniformly to a triple of conjugate harmonic 
functions, or for which 2° + y’ + 2 converges uniformly to infinity, in every 
closed region in D. It is a well known fact” that a family of functions {h(u, v)}, 
harmonic and uniformly bounded in D, constitutes a normal family of harmonic 
functions in D, and that if 


[h,(u, v)], 2=6,1,2,.---, 
is a convergent sequence of the family, then the sequence 
g’** 
| rae hal o| n=0,1,2,---, 


j and k being fixed, converges uniformly in every closed region in D to the 
corresponding derivative of the limit function. It follows immediately that a 
family of triples of conjugate harmonic functions, uniformly bounded in a 
domain D, constitutes a normal family of triples of conjugate harmonic 
functions. 

1 T. Rad6é, Some remarks on the problem of Plateau, Proceedings of the National Academy 


of Sciences, U. 8. A., vol. 16 (1930), pp. 242-248. 
12 See O. D. Kellogg, Foundations of Potential Theory, Berlin, 1929, Chapter X. 
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Denote” (u, v) = (0, 0) by O, (7, 0) by A, and (7 cos a@, 7 sin a) by B. 
Draw a line through O making an angle a — o > O with OA and cutting the 
are AB at C. Let 0 < 6 < ry. Construct ares with center O, radii 6/2”, 
cutting OA at A, , and OC at C,, n = 0,1, 2,---. Let D, be the domain 


eg 
bounded by AnAnsiCniiC, An. Then, for (u,v) in Do, (u/2", v/2") is in D,,. 
Define 


r,(u,v) = (x, x) Yn(u, v) = (x, +) 2n(u, v) = (x x) 


In Do, rn(u, v), yn(u, v), 2n(u, v) take on the same values that x(u, v), y(u, v), 
z(u, v) take on in D,. Since z(u, v), y(u, v), z(u, v) are bounded, the sequence 


(1) [r,(u, v), yn(u, v), zn(u, v)], n = 0, 1, 2, 


forms a normal family of triples of conjugate harmonic functions in Dy. Then 
there is a subsequence 


[rn,(U, v), Yn (U, v), Zr, (U, v)], k=@,1,2,.-.-, 


which converges uniformly in Dy to a set of conjugate harmonie functions 
F(u, v), 7(u, v), Zu, v). 

Since z(u, v) is continuous on OA, x,,(u, v) is continuous there, and, for 
(u, 0) on the boundary of Dp , 


‘ , . u 
E(u, 0) = lim z,,(u, 0) = lim z{ =, 0) = Xp; 
k—>o0 bow \o* 
similarly, 7(u, 0) = yo, Zu, 0) = 2. Therefore, by Lemma 1, #(u, v) = 2, 
g(u, v) = yo, Zu, v) = %. 

Now the entire sequence (1) must converge to 2, Yo, 20, since otherwise 
there would be a subsequence which converges to a triple of conjugate harmonic 
functions other than x», yo, 20, and the above analysis shows that any con- 
vergent subsequence converges to 2, Yo, 2. Therefore, for (u, v) in Do, 

lim x,(u,v) = 2, lim y,(u,v) = yo, lim z,(u, v) = 20; 

n—-o2o nm —o n--o 
but the values of z,(u, v), yn(u, v), Zn(u, v) in Do are the values of x(u, v), y(u, v), 
z(u, v) in D,, so that x(u, v) — x, y(u, v) > yo, z(u, v) — z as (u, v) > (0, 0) 
in the sector 0 < are tan (v/u) < @ — o. 


3. Proof of Theorem 2. Approximate to I in the sense of Fréchet by a 
sequence T, , n = 0, 1, 2, --- , of simple closed polygons. By Theorem 1, the 
problem of Plateau is solvable for I,, ; further, by means of an adjoined linear 
fractional transformation, the solution can be so normalized that three distinct 
points A, B, C on u’ + v° = 1 are carried into three arbitrary distinct points 


8 The following proof parallels a proof of Montel for analytic functions. See P. Montel, 
Lecons sur les familles normales de fonctions analytiques, Paris, 1927, pp. 188-192 
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A,, B,, C, on T,. Choose three distinct points A*, B*, C* on IT and let 
A, — A*, B, — B*,C, — C*. Let now 

z= 2,(u, v), y = ynl(u, v), z = z,(u, v) 


solve the normalized problem for [,,, and let the corresponding boundary 
functions be 


4 v 
xr = &,(¢), y = n,(¢), z = ¢,(¢), ¢ = are tan = 


° ° ° ° 14 ° 

An immediate generalization” of the fact that a uniformly bounded sequence of 
monotonic functions must contain a convergent subsequence assures us of the 
existence of a subsequence 


(2) > = En.(¢), 7 = nn (¢), .= Tule), k=0, 1,2,---, 
converging everywhere on uv’ + v° = 1 to limit functions 

(3) z= &), y = ne), z = $(9), 

which map u’ + v° = 1 monotonically on T, carrying A, B, C to A*, B*, C* 
respectively. 


Consider the harmonic functions 
(4) x = x(u, v), y = y(u, »), z = 2(u, v), ute <1, 


obtained by substituting the functions (3) in the Poisson integral formula. 
Since the functions (2) are uniformly bounded, we may pass to the limit in the 
corresponding sequences of Poisson integrals, so that the functions 


In,(U, v), Yn.(U, v), Zn,(U, v), k = 0, By 2, Pa 


and their partial derivatives converge in u’ + v° < 1 to the functions (4) and 
their corresponding partial derivatives. Since E,, = G,,, F», = 0, it follows 
therefore that E = G, F = 0. 

That the conjugate harmonic functions (4) give a solution of the problem 
of Plateau for the curve T will be established when we show that the boundary 
functions (3), which map uw’ + » = 1 monotonically on I, actually map 
u’ + v° = 1 topologically on T. Now the functions (3) cannot remain constant 
on an are of wu’ + v° = 1, by Lemma 1 and the three-point condition. And, 
by the monotonic character of the map, the functions (3) have definite one- 
sided limits for each value of ¢; we shall show that these limits are the same 
from both sides at an arbitrary g. Because of the nature of the possible 
discontinuity of &(¢) at ¢ = go, it follows from a well-known property of the 
Poisson integral that x(u, v) approaches a definite limit if (u, v) — (cos gw, 
sin go) along any straight line in u” + v° < 1, this limit being a linear function 
of the angle which the straight line makes with a fixed direction and varying 
from & (go) to &,(¢go). Similar statements hold for y(u, v) and z(u, v). But if 


“4 T. Radé, first footnote, p. 771. 











let 


lem 
ary 
nap 
ant 
nd, 
yne- 
ame 
ible 

the 

$0 » 
tion 
ying 
it if 








REMARKS ON PROBLEM OF PLATEAU 681 


we join two such straight lines by a circular are lying in u’ + v* < 1, we obtain 
a sector for which Lemma 2 applies; consequently, (x, y, z) — a definite 
(x0 , Yo, 2.) Which does not vary with the angle. That is, the linear functions 
mentioned above are constants, whence 


£(y0) = &(¢0), n—(yo) = n+(¢0), S—~(Go) = £4(¢0). 


Therefore, the functions (3) map wv + v = 1 in a one-to-one way on I. 


4. Lemmas | and 2 are essentially theorems im kleinen, and their proofs are 
independent of the dimensionality of the containing Euclidean space. There- 
fore, these lemmas may be used to discuss the behavior on the boundary of 
functions giving isothermic representations of minimal surfaces bounded by 
several Jordan curves in Euclidean n-space. 


Rice INstTirure. 








ANALYTIC FUNCTIONS OF ABSOLUTELY CONVERGENT 
GENERALIZED TRIGONOMETRIC SUMS 


By R. H. CamMEron 


1. Introduction. It has been shown by Wiener’ that a nowhere vanishing 
periodic function with an absolutely convergent Fourier series has a reciprocal 
whose Fourier series also converges absolutely. Lévy” has pointed out that 
this result can be extended from reciprocals to general analytic functions. 
Thus if f(x) is periodic and never zero and has an absolutely convergent Fourier 
series, it follows that F[f(x)] also has an absolutely convergent Fourier series 
provided that F(z) is analytic and single valued whenever z = f(x). One of 
the results of this paper (Theorem I) shows that these results are true in n 
or even No dimensions. This is accomplished by carrying through Wiener’s 
proof with the necessary modifications to take care of dimensionality. 

One might reasonably ask whether this result can be extended from periodic 
to almost periodic functions. A partial answer to this question has been given 
by Bochner,’ who has shown that reciprocals of trigonometric polynomials which 
are bounded away from zero on the real axis have absolutely convergent Fourier 
series. It is shown in the present paper that the theorem is true not only for 
trigonometric polynomials, but also for absolutely convergent infinite trigo- 
nometric sums. No further hypothesis is required; so the exponents are alto- 
gether unrestricted and may be any countable set of real numbers. Moreover 
this result is true not only for reciprocals, but for all analytic functions; and it 
holds in n or even Np» dimensions. Thus the final result of the paper is 

TueoreM II. Let f(x, , re, --- ) be an almost periodic function with an abso- 
lutely convergent Fourier series, and let R be the closure of its set of values. Then 
if F(z) is a function analytic over an open set S containing R, it follows that 
F{f(ai, x2, --- )| is an almost periodic function with an absolutely convergent 
Fourier series. 


Received July 12, 1937. 

1N. Wiener, Tauberian theorems, Ann. of Math., (2), vol. 33 (1932), pp. 1-100; p. 14. 

2 P. Lévy, Sur la convergence absolue des séries de Fourier, C. R. Acad. Sci., Paris, vol. 196 
(1933), pp. 463-464. 

2S. Bochner, Beitrag zur absoluten Konvergenz faslperiodischer Fourierrethen, Jahres- 
bericht der Deutschen Math. Ver., vol. 39 (1930), pp. 52-54. 

* After this paper had been submitted for publication, the author learned that his main 
theorem (without the extension to analytic functions or to more than one dimension) 
has been proved independently by H. R. Pitt. Apparently Pitt’s work was done somewhat 
earlier than the author’s, though it was not submitted for publication until about the time 
the present paper was accepted for publication. It will appear in an early issue of the 
Journal of Mathematies and Physics, Massachusetts Institute of Technology. 
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2. Absolute convergence a local property for periodic functions. Before 
proving Theorem I we shall need to extend to infinitely many variables Wiener’s 
lemma’ that a periodic function has an absolutely convergent Fourier series if 
in the neighborhood of every point it is equal to a function having an absolutely 
convergent Fourier series. Such a generalization naturally depends on the type 
of neighborhoods we use, and the appropriate neighborhoods in this case are 
defined as follows. We consider the space whose points P(x, x2, ---) are 
unrestricted sequences 2, 22, --- Of real numbers. Then corresponding to 
each point P(x, , 22, --- ), each « > 0 and each positive integer n we define the 
(e, n)-neighborhood of P to be the set of all points x}, 23, --- satisfying 


r;— x; < € (mod 27) (j = 1,---,n). 


Jessen® has shown that for such neighborhoods in which all but the first n 
variables are unrestricted the Heine-Borel theorem holds for the whole space. 
This fact enables us to extend Wiener’s proof to infinitely many variables and 
obtain 
Lemma 1. Let f(P) = f(a, 22, ---) be periodic of period 2m in each variable. 
Suppose further it is known that corresponding to each point P’(x;, x2, +--+) 
there exist e» > 0 and np and a function fe (P) = fe(a1, t2, --- ) which equals 
f(P) throughout the (ep , np-)-neighborhood of P’ and has an absolutely convergent 
Fourier series 
w ( agP ) 
felP) = AY” +> AP” exp \7 > vi; 2; . 
: 2 


n=l \ 


Then it follows that f(P) has an absolutely convergent Fourier series 
2 kn 
S(P) = Ao + > A, exp ‘i > pati) 
r= 3= y, 


For by the Heine-Borel theorem there are a finite number of points 
P,, P:,---, P, such that every point P is contained in Np, + Np, + --- + Ne,, 
where Np, is the (Jer; , ne;)-neighborhood of P; ; and we shall show how to fit 
together absolutely convergent Fourier series in these neighborhoods to make 
f(P). For positive values of & < a, let T;:(x) be periodic of period 27 in x, and 
let it be defined by the equation T;(7) = max [1 — | 2 //&, 0] in one period 
—rSszreg. 

Obviously this function consists of equally spaced isosceles peaks of height | 
with horizontal lines of height zero in between. Again, let 


Ts ,m(a1 >» OR, eee ) = T (21) T (22) eee T (tm), 


and note that 7’, m(ai, a, ++: ) vanishes outside of the (£, m)-neighborhood 
of the origin and has a peak (or infinite dimensional edge) of unit height a’ 


5 N. Wiener, loc. cit., p. 10. 
® Jessen, The theory of integration in a space of an infinite number of dimensions, Acta 
Math., vol. 63 (1934), pp. 249-323; p. 256. 
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a = 0, rm = 0,---,%m = 0. Finally, if « = min (e,, --- , e,) and n = 
max (np,,--- , np,), and if \ is an integer so great that 2 < «¢/(2z), let 
Un. ele _ Uy. “spay Te,°° -) - Torx, n(1 - 2 ru, 2+ an 2 run) 
and note that 
24+1-1 
| ORO | = 1, 
w=" "=n 0 
and hence that for all P, ; 
2a+1—} 
(1) KP) = DU yyeee-un(P) $(P). 
M1 n= 


Now if in each term of this sum we replace f(P) by a function which equals 
f(P) except when the coefficient of f(P) is zero, the equation will still be true. 
But such functions can be found with absolutely convergent Fourier series. For 


Q: 2“wy,, 2 “mye, --- . 2 ru, , 0, 0, --- 


is contained in one of the neighborhoods Np, , --- , Ne, , say Ne, ; and it follows 
that the (2-“x, n)-neighborhood of Q is contained in the (ep, , np,)-neighbor- 
hood of P,. Thus for all P 


Care --stn(PIS(P) = On, sun(P)fe,(P); 


and since 7:(x) has an absolutely convergent Fourier series, so do T;(x;, x2, ---) 
and U,,,...,(P) and U,,,....4,(P)f(P). Consequently, it follows from (1) that 
f(P) has an absolutely convergent Fourier series and the lemma holds. 


3. Fourier series of small absolute value sum. Again following the course 
of the Wiener argument, we prove the 

Lemma 2. Let f(x: , 2, --- ) have period 2x in each variable, and let it have an 
absolutely convergent Fourier series, the sum of the absolute values of the coeffi- 
cients other than the constant term being K. Then if F(z) is a function analytic 
inside and on the boundary of the circle |z — f(0, 0, --- )| S$ 2K, it follows that 
F{f(ai, re, ---)| has an absolutely convergent Fourier series. 

For F(z) can be expanded in a power series about z = f(0, 0, --- ), and the 
Fourier series of f(z: , 2, --- ) can be formally substituted for z in this power 
series. The sum of the absolute values of the terms arising from (z — 2)” 
will be less than or equal to (2K)", and hence the whole series with all paren- 
theses removed will be absolutely convergent. 


4. General periodic functions with absolutely convergent Fourier series. We 
are now in a position to prove 
Tueorem |. Let f(r, x2, --- ) be a function of period 2x in each variable, 
x 
and lt the sum >. A, of the coefficients of its Fourter series 
0 


«“ ky 
(2) f(x, Te, °° ‘) = Ay + > A,, exp E oe past | 
j= 


n=l 
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be absolutely convergent. Then if R is the closure of the range of f(x, 22, --- ) 
and the function F(z) is analytic in an open set S containing R, it follows that 
F[ f(a, x2, --- )] also has an absolutely convergent series: 


«© ly 
(3) F[f(ai, x2, ---)] = Bot+ p> B,, exp E > anit 
a= = 
On account of Lemma 1, we need only show that corresponding to each point 
P'(z1, ves ) there exists an ep > 0 and a positive integer np and a func- 
tion gp-(x,, t2,--- ) which has an absolutely convergent Fourier series and 
which equals F[f(z:, x2, --- )] throughout the (ep, np-)-neighborhood of P’. 
For under these circumstances Lemma | establishes the existence and absolute 
convergence of the series (3). And since the function fran, Ye, -++) = 
f(a — hs te — &*? ) satisfies the same hypothesis as f(x, , re, --- ), it 
follows that we need only consider the origin and show that there is a function 
g(r, 22, --- ) which equals F[f(a., x2, --- )] throughout some neighborhood 
of the origin and has an absolutely convergent Fourier series. 
Now for any function g let o(g) denote the sum of the absolute values of the 
Fourier series of the function g, and let f(0, 0, --- ) = 2, so that 


2 kn 
f(x, T2,°° -) —-%= > A.{exp E »» pati | - iS 


7 


Let 6 > 0 be so small that z is in S if | z — 2 | < 6, and let N be so great that 


Thus if 


N f kn 
g(P) = g(a, 22, ---) = DA, yex E > paiti| _ i} 


n=l aie 
and 
2 if kn ) 
MP) = YL Andsexp id posts| =i 
n=N+1 \ i=! 


it follows that f(P) = zo + g(P) + A(P) so that o(h) < {6. Now for0 < & < $x 
N 
define WAP) = IT Vilz,), where 


n=l 
( 0 if |r; 2 2¢ (mod 27), 
; a | 
V(x) = y odie if ¢S/x| S 2 (mod 27), 
s 
i if x| st (mod 27); 


and note by actual computation that Ve(r) (and hence also W;(P)) has an 
absolutely convergent Fourier series whose absolute value sum is a bounded 
function of — Moreover, lim o[V¢(r)(e""” — 1)] = 0; for if Y’ denotes the 


f—-07* 
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sum from n = — x ton = +, omitting 0 and p, we have by actual computa- 
tion for all integers p, 
o[V;(x)(e'”? — 1)] 

_9 | 3— cos p= — cos 2pt 

Or p’ rt 

+>’ 2[sin 3(2n — p)é sin 2pt hes sin (2n — p)é sin pé] 
(n — p)? xt 
2(2np — p) sin 3né sin 3né | 


(n — p)?n? xk 








9 3&2 sin }pé sin 3p& 
2r prt 


IIA 


Ph 3 f2 sin 3(2n — p)t |'-| sin }pé | +3 | sin (Qn — p)é |*-| sin pé | 
\ (n — p)*né 
2|2np — p’|-|sin 3né|-| sin }ng |} 
= a “per 
(n — p)*n? ré 
BE, 9 Zip SpE 2 (Qn — p)é|'-| 4pE| + 2|(2n — phe |*-| pe | 
or Pp" wt \ (n _ p) wt 


WA 
Li) 
| 

fo 


2|2np — p’|-| 3né |-| 3né ls 
+ 9.9 
(n — p)*n*® rt 


IA 


6§ +3 mle 1(2n — p) |} + 2p\2n — p + 2 2Qnp — p |-3) 
r (n — p)*x (n — p)?|n|'mr } 


Now consider W;(P)g(P), and note that 
lim o [W,(P)g(P)] = 0, 


t-0* 
since 


lim o[W,(P)e Pm nti Paz ni (e'Pm7" — 1)] = 0 


s 
holds for any integers p,, and implies 


lim o[W(P) (et Paty 1) a 0. 


Finally, choose « > 0 so small that 
ao (WAP )g(P)| < 46, 
and consider the functions 
FUP) = ey + WAP)g(P) + ROP). 
Since 


g(0, O, --- ) h(O, O, © -- ) 0 

















cal 








ABSOLUTELY CONVERGENT GENERALIZED TRIGONOMETRIC SUMS 687 


the constant term of f(P) is f(0, 0, --- ) = 2; and since 
o[f(P) — 2] = o[W.(P)g(P) + A(P)] < 36, 


Lemma 2 applies to f(P) and shows that go(P) = F[f(P)] has an absolutely 
convergent Fourier series. But by definition W.(P) = 1 throughout the 
(e, N)-neighborhood of the origin, and hence go(P) = F{f(P)] in the neighbor- 
hood, and our proof is complete. 


5. Almost periodic functions. We can now pass to the case of generalized 
Fourier series and prove Theorem I1, which has been stated in the Introduction. 
Let 

bed Pn 
S(ai, %2, +--+) = Ao+ _ A,, exp E _ Ani r| 
n=l1 7=1 
The proof can be based on Theorem I in the following way. Let 6 > 0 be so 
small that if | z: — z| S 6 and ze R,, then z« S; and let R* be the closed 
set consisting of all points whose distance from R is not greater than 6. Let NV 
2 
be so great that - 2 |A,| < 36, and let uw, we, --- , 4, be an integral basis 
n=N+1 
for all \,,; for which n S N andj S p, ; so that 


An, j = Zz. Kn. jo by (n = 1, Ne a N;j = - ae a Pn). 
v=l 
Here the k,,;,, are integers, and no integers k, , --- , k, except 0, --- , 0 make 
D ku, = 0. 
v=l - 
If p is the greatest of p,, --- , py , consider the functions’ 
ACY ia, a Yi° > alec Fists oa Pas > Ewity Ense, pee -) 
‘ { Pa s \ 
= Ay + rk A, expy? 2 > are gee 
n=1 \ gel r= 
and 
gYia, ae? 's Feat — > ae sities! Tas s Ewit, Evie, 5 -) 


= AViry ees) Vues ote 3 Ves sees Vines Ewen, Ewses - ++) + PR Ave™. 
=\+ 


It is clear that g has period 27 in all its arguments, and that 


PNert Pyee 
O{wtas mates <5 matnn <p mates De Resta p oe 
j=l 


3=1 


= f(ti, ts, +++). 


7 Putting in exponents arbitrarily as we have done after the V-th term changes the 
range of the function; while putting them in according to the basis as Bochner did in his 
paper (loc. cit.) and as we have done in the earlier terms would require the use of limit 
periodic funetions if it were carried out for all the terms. The author wishes to thank 
Professor Norbert Wiener for suggesting this combination of the two methods 
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Moreover, the closure of the range of h is the same as the closure of the 
range of 


Pn 


N 
Ap + > A, expt Zz Aaj Zj- 
n=l ?=1 


Thus by Theorem I it follows that F(g) has an absolutely convergent Fourier 
series 


2 Pp * In 
Bo + >. B,, exp E . ? a ee + 2 2, rats. 


n=1 j7=1 v= 


Hence 
x p ° In Pj 

F[f(ar, 22, ---)] = Bo + - B,, exp E > is : Qn. jvMv Lj +2 Ws po ras diet b 
n=l j=1 v=1 j=N+1 v=1 


and Theorem II is proved. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 


























TRANSFORMATIONS ON SEQUENCE SPACES 
By L. W. CoHen AND NELSON DuNFoRD 


Hardy and Littlewood,’ Littlewood’ and others have given certain necessary 
and other sufficient conditions on the matrix a,; in order that the bilinear form 
YLa;;x;x; be bounded for >| 2; |” S 1, =| yi |* S 1. So far as we know no 
conditions on the matrix a;; alone have been given which are necessary as 
well as sufficient for the boundedness of the corresponding bilinear form. In 
this paper we consider among other questions the more precise problem of 
determining the norm of the linear transformation y = Tr on l, to 1, in terms 
of the elements a;; of the matrix representing this transformation. We have 
been successful in the special cases where T is on |, to 1, or ¢ and, less trivially, 
on l, or ¢ to l,, if a;; = 6. Conditions for the absolute convergence of the 
determinant of (6;; + aj;) representing J + T as well as properties of the 
matrix of minors are also obtained. These last conditions together with neces- 
sary and sufficient conditions for compactness have been given for a Banach 
space with a denumerable basis g,. In such a space each element z is uniquely 
representable as 





2 
z= Le ta en z, = T 22, 
= 

where 7% is a linear functional on the space ® with | 72| < My». In such 
a space the convergence of x™ to x implies the uniform convergence of x7 to x, . 

In view of the well-known theorems on uniform boundedness of sequences of 
linear operations and the known conditions for weak convergence in many 
Banach spaces, it is comparatively trivial to give the form and norm of the 
general linear operation with the range in c, m = 1, , C, M (bounded functions), 
ete. Consequently these cases have been omitted from the discussion of such 
questions. 

TueoreM 1. If and WV are Banach spaces with denumerable bases and Tr = y 
is a linear transformation of ? into V, the transformation is represented by 


(1) cee LD aijz;, 
j=l 


Received June 2, 1937; presented to the American Mathematical Society, December, 
1936, under the titles: L. W. Cohen, Transformations on spaces with denumerable basis; 
Nelson Dunford, Linear transformations of sequences. 

1G. H. Hardy and J. E. Littiewood, Bilinear forms bounded in space [p, q], Quarterly 
Journal of Math., (Oxford), vol. 5 (1934), pp. 241-254. 

2 J. Kb. Littlewood, On bounded bilinear forms in an infinite number of variables, Quarterly 
Journal of Math., (Oxford), vol. 1 (1980), pp. 164-174. 
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where 
V7 
ay = T; Te; . 
The i-th equation is a linear functional L;x on ® such that 
[Li] $|T| My, 
and the series (1) converge uniformly with respect to i. Conversely, if a;; is a 
matrix such that for every x «® there is a y € WV satisfying 


(2) Tiy= > aT; 2, 
j=l 


the transformation is continuous. 
Proof. For each j 


x 


Te, = 0) = Dak, 


t=1 


where y; is the basis of VW. For z 4, let 2°” = ps z;¢;. Then 


j=1 
y= Tz" = > x;y} = Lud aij;Zj, 
i= v= i= 
y=Tr= DL wis, 


and 


' (n) 
yi — Dayz; < My \ly—y"|| S$ My|TI|-\|2-2 || 
j=1 

imply the uniform convergence of the series (1) with respect toi. Since | y;| S$ 

My |T|\-\|x\|,| Lil S My|T|. 
n 

Now, conversely, for fixed i and n the function >> a; T? x is continuous in z 

j=l 

and hence for fixed 7 its limit, i.e., the function 


2 
® 
i= z. aT; z, 


}7=1 
is also continuous in x. Thus 


y= E (Sart) 


i=1 \j=1 
as well as its limit y is a continuous function of z. 

In specializing the general linear transformation Tz = y, one may require 
that it be completely continuous, i.e., that it carry a bounded set into a com- 
pact set. We state the following condition for compactness. 








Wy 


ire 
mn- 
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THEOREM 2. A set X C is compact in ® if and only if X is bounded and 
n 
lim 2j9; = 2 
n j=l 
uniformly for x « X. 


Proof. From a sequence (y”) of points in X a subsequence (x”) may be 
chosen such that, for all 7, 


. $ 
lim T; 2” = a. 
m 


lim > T? 2x"o; = 2” 


n i=l 
uniformly with respect to m and 


lim >> T?2"¢; 


m i=l 


exists for all n. Thus lim x” exists and X is compact. 


Conversely, suppose that X is compact in @. The sequence 


Sn(x) = » LiQi 


of linear transformations is convergent, hence equi-uniformly continuous. For 
any « > 0 there is a 6, > O such that, for all n, 


| \ € ' ' 
| fn(x) || <% \| x || < 6,. 
There are x’, --- , 2" in X such that for x « X and some i = 1, --- , n(e) 


\|a — 2° || < min (5,4). 


There is an N, such that n > N, implies 


, i ii € 
fle’) - Il <§ 
Now for n > N,, x « X and some 7, we have 


[I fa(z) — x || S || fale) — fale’) |] + [I fale") — 2° || + Il 2*-— al <e 


Corotiary 1. Jnl, (1 p < ©) a set X is compact if and only if X ts 
bounded and 


IA 


lim >> | 2; |” = 0 


n =n 


uniformly with respect to (x;) in X. 
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Coro.iary 2. Inc (the space of convergent sequences (x;)) a set X is compact 
if and only if it is bounded and lim x; exists uniformly with respect to (x;) in X. 


5 | 
Coro.uary 3. The general completely continuous linear regular transforma- 
tion of c into c is 


2 
yi = a; lim x; + > Ajj;Xj, 
] j=l 
where 
~w 
lim a; = 1, lim bis |a,;| =0 
i s gud 


and the norm of the transformation is 
f 


sup | a,|+ 2. | ij i. 


j=l 


Thus, while every Toeplitz matrix is continuous, none is completely continuous 
on c toc. We have the formulas 


Yi 


n 


na a 
a; lim y bnjxi + , x Ai;X;, 
j=l 7=1 


nan 
a; Lim 2; + > AijXj, 


j=l 


Yi 


where (b,;) is a Toeplitz matrix and Lim is a generalized limit defined for all 
bounded sequences. These formulas yield completely continuous regular sum- 
mation methods whose domain of definition is the set of all bounded sequences. 

From Theorem 2 we also have 

Tueorem 3. If T is linear and completely continuous on ® to V and W has a 
bounded basis, then 

lim ¥"" = ¥7 = Te, 
n 
uniformly in j. 

It will appear below (Theorems 12, 13) that this condition is sufficient for 
the complete continuity of 7 in the cases where @ = lL, ¥ = l, (1 S p < @), 
Y=&%. 

To formulate sufficient conditions for complete continuity we may use the 
following theorem which is well known in case the transformations involved 
are linear. 

Turorem 4. Jf 7,2 = y is completely continuous ona Banach space S toa 
Banach space S’ for each n, and if for any « > 0 there is an n, such that 


Tz — T,z|| S €|\z|l, (n > n), 


then Tz 1s completely continuous. 
Proof. Let X be « bounded subset of S and y”™ a sequence in TX. There 
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is a sequence of sequences x; in X such that each Tz} is a y”, (27) D (x}*") 
and lim 7,2} exists. Writing r" = x% we have for « > 0 
q 


, ’ k ' , 7 ' ' ’ kit 
\| Tz” — Tz” || = || Tz” — Taz” || + || T.2" — Taz” | 


= ' 


+ || T.2"" — Ts" || < 96M + | Taz” — ie 


i? 


if |}2|} S$ M whenzeX andm>m,. Then for n > n».,k > 0 we have 


\| Tx” — Tx"** || < (2M + le. 


Since S’ is complete, TX is compact. 
In case T is a linear transformation on ® to V, we have 
TueoreM 5. A necessary and sufficient condition for the complete continuity 
of T is that lim | T, | = 9, where T, 2 = y is defined by the matriz 
n 


(n) . 
a;; = 0, i<n, 


IV A 


= aij, t2n, 
associated with the matrix (a;;) of T. 
Proof. The sufficiency follows from the complete continuity of T — T,, 


lim | T — (T — T,)| = 0 and Theorem 4. Conversely, let y = Tz be com- 


pletely continuous and 


~~ 2 
y = 2 Yidi, y” = p> yi. 


If S is the unit sphere in #, then for every « > 0 there is an n, such that for 
alln > n, and reS, 


because of Theorem 2. For any fixed n > n, there is an x(n) e S such that 
pict ) | € 
| na} = ji na&(n 1} + )’ 
so that 


T.| silly" (@)ll+s<e« 


where y(n) = Tx(n). 
With a view to obtaining results on absolutely convergent determinants we 
state certain sufficient conditions in terms of the rows and the columns of the 


2 rp rp . . *,* 
matrix (a,;) = (7; 7¢;) separately and impose the following conditions on ®. 
ao 
» me } 
(a) Ife, then ze’ = a | 7,2 |¢n €® and || x" || = || r|I. 
n=l 


(b) 10 < T8r < Tle’, then || x || Ss || 2’ 


These conditions are satisfied by the spaces J, and cp . 
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THeoreM 6. If T is linear on to V and 
oe 
L= [Lil Wey, 
i=l 


then T is completely continuous. 
Proof. Let T, be defined by the matrix 
a;; = TiT¢;, 1 


= 0, i>n. 


IIA 
IIA 


n, 


Since T,,z lies in a finite-dimensional subspace of ¥, T,, is completely continuous. 
For any « > 0, 


|T2—Taz|| =) 2 ye =|) Oo lyl vi 


i=at+l i=n+l 


ae |l-|) 20 [Lili <ellell, 


i=n+1 


IIA 


ifn > n,.. Thus T is completely continuous by Theorem 4. 


n no qip’ 
Corotuary. If >, a | ai; r’) < «, then y; = 


a 
a;;x; ts completely 
i=l j=1 1 


, 


continuous onl, tol, (p + p’ = pp’). 
TuHeoreM 7. If T is linear on & to V and 





1 & 
» 2 1 Vj | vj 
j=l 
is a linear functional on ® such that for « > 0 
. i. P P : 
2 ly; |l-!a;| <el|laz]| (n < n,), 
j=ntr+l 
then T is completely continuous. 
Proof. Let T, be defined by the matrix 


aij = TiT¢;, l 


lA 


IA 


n, a 





2] 
= 0), y>n. j 


T,, defined on an n-dimensional space, is completely continuous. Now for 
e>0,n >n, andk > 0 


nok nek cA 

* ' * 1 l * | 
DW < DL |z;|-\¥7 Il s Dail liv ll < [lz]. 
=n j=nt pent 


. " vyy (mn) ' = = +. 
Further, 7,2 = Tz” = 7 > 2:9; > ziv7, so that 
j=l i=l 
’ - * if 
Tz —T,2|| = || >, Lipj || Se |{ x]. 


jen 
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C) 2 p’l@ 2 
Corotiary. If >. (= | Qi; ') < ©, theny; = > a;;x; ts completely con- 


j=l \iwl j=l 
tinuous on l, to l,. 

The next stage of restriction on T is made to assure the absolute convergence 
of the determinant of the matrix of J + T. The condition is essentially the 
combination of the conditions of the last two theorems. 

THEOREM 8. The determinant of the matrix (6;; + a;;) defined by I + T on 
® to ® is absolutely convergent if 


lLLea=> | a;; | x; ts a linear functional on 4; 


7= 


2L= ¥ | Ll eve®, | L | «3% 
3. gr = p> || g} || a; is a linear functional on ®, g; = T¢;; 
, = 
4. > | ass =a< om, 
Proof. According to the theorem of von Koch it is enough to show that 
}» | ass | + - i | Qisy iysg > Cine | 


converges. We show first that for all 7, 7, n 


ol 


(1) _ > _ | Git Minis +++ digg] S | Ls |-|]L||"*+|| @ Il. 
a 
For n = 1, 
yy |as,as,;| = Lig; s [Li |-lle7 Il. 
‘1-= 


From the inductive assumption we have 





? t aw a xa 
; >» | ass, Qijig *** Gin4,i | - ye | ass, | : } i | Qinig cat Ain gyi | 
. . Te Pe | i,=1 So°* cinta) 
r < > p rT n—l )) * jjn—h yy 

— 3 fas || La Po{l EII “lhe; | = (| L]| min (| LoL 

‘1 
! a, *, 
S [Le Ll" |e; Ii. 





Putting ¢ = 7 in (1) we have 


2 wo ~ 
pi , p | iy Qigig aan as, 3 | = | Z |i" p> Iles | “|Z; 


deel dyes ssl sel 
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Thus 
Dal + DD lama as) Sat le) D [Li at eel 
i= n=1 ii): n= apes | 


and the theorem is aie 
The matrix of minors A,;; has certain properties similar to those of the 
matrix a;;. 





A;; | 9; €® for each i. 


2 | | Aj; | x; is a linear functional on ® for each j. 


Proof. Let A,- be the minor of a,. (r # c). The matrix of A,,, after a 
finite number of interchanges of rows and of columns has the form 


Sew (air) ¢ yt ?) 


(a.;) | (i + ne 
(j#r) | (i#e,j#?r) 
If P = I(1 + | p |) is the product extended over all circular products 
P = Qi, Qiig ++ Ain (i, 41, +++ , t distinct), 
then 
| Are] & P{!aer| + > 5. , | Geis Garey -++ dil}. 


n=1 i)" 


From (1) of the previous proof 


| Le | +| atl 
A re s P | Ger + ; i ae 
ue. T= |LT 


Since | a,.| < | L.|, || ¢* ||, we have 
ee |! 
|L.| P41 A 
ne a+ le 


Are) & let || PAL + = ; 

1— ||L|| 
forr Ac. But | As| S P for all 7. Hence under the postulates (a) and (b) 
the theorem holds. 

We turn to a consideration of the spaces Ll, and cp in order to determine the 
exact conditions for continuity and complete continuity of the transformation 
and to evaluate the norm in certain cases. 

THeoreM 10. The equations 


(1) y= Dd aijz, 
1 


| Ave | 


IIA 
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define a linear transformation on l, to l, if and only if 
od l/p 
(a) sup (= aij ") < ow, 
i i=1 


and this constant is the norm of the transformation. 
Proof. Let ¢; be the vector with one in the j-th place and zero elsewhere. 
Then 


eo l/p 2 bad lip 
sup (= | aij ’) = sup > inl (X | dij ‘) 


i i=1 z\=1 j=1 i=l 


2 oo o\ lip 
sup (= Dd az; ‘) = |T| 


zi=1 i=l j=l 


fj @ lip 
sup || Tg; || = sup :. aij ‘) ; 
i i 


i=l 


IV 


IV 


THEOREM 11. The equations (1) define a linear transformation on |, to co if 
and only if 


(a’) sup sup | a;;| < », (a’’) lim a;; = 0, 
i i i 


and the constant (a’) is the norm of the transformation. 
Proof. Let ¢; be the unit vector with one in the j-th place. Then lim a,;; = 0 


since Tg; = (aj;) €¢o for each j and 
oo 
sup sup |a;;| = sup |! Ty;|| S$ |T| = sup sup > air; < sup sup | aj; 
i i i z\i=1 i j=l i i 


TueoreM 12. The equations (1) define a completely continuous transforma- 
tion onl, tol, (1 S p < «) if and only if the matrix (a;;) satisfies the conditions 
(a) and 


bal l/p 
(b) lim sup (= | ai; ’) = 0. 


n i i=n 


Proof. The necessity of (a) follows from Theorem 10 and that of (b) from 
Theorem 3. The conditions are sufficient since 


a lip 2 lip x 
(x in") < sup (3 | Ai; ") > Ee 
? in i= 


and Theorem 2 show that bounded sets go into compact sets. 
TueoreM 13. The equations (1) define a completely continuous transformation 
on |, to co if and only if the matrix (a;;) satisfies the conditions (a’) and 


(b’) lim a,; = 0 uniformly in j. 
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Proof. The necessity of these conditions follows from Theorems 11 and 3. 
The conditions are sufficient since 
x 
sup | yi] < sup sup | aj;| Do | x | 
izn i2n j i=1 
and Theorem 2 show that the image of a bounded set is compact. 

The roles of l, and 1, or co) in these theorems may be interchanged if the a,; 
are positive and the norms of the transformations evaluated. The evaluations 
are corollaries of the following theorem in which the postulates (a) and (b) 
on ® used in Theorems 8 and 9 appear in a weaker form as one restriction on ® 
and one on the conjugate space ®. 

THEOREM 14. Let a;; = 0 and & be a space with a basis ¢, such that 

(i) af 7 Lig; converges so does 8 | 2il ges; 

+ s 

(ii) the norm of a point’ (a;) e® is not decreased when the values | a; | increase. 

io) 


Then equations (1) define a linear transformation on ® to l, if and only if >» a;;€® 
t=1 


4 x 
(7.e., if and only if a a; ; x; converges whenever > x;9; converges). Furthermore 


j=l i=l 2 
the constant 


a 
) » aij || _ 
t=1 2 


is the norm of the transformation. 
Proof. Let f(y) be the linear functional on |, defined by 


fly) = D> Yi. 


If equations (1) define a linear transformation y = Tz on # tol, , then fT is a 
linear functional on @ with f7| S| 7 |. But 


L 


{Tx - > D aj; = Dd az). 


tl j~1 j=l 


If (z,) is the unit vector with one in the j-th place, we see that 


(a;) = (x a) e 


inl 


and 














oer 
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In what follows f will stand for an arbitrary linear functional on ],. From a 
well-known theorem (Banach, Théorie des Opérations Linéaires, page 55, 
Theorem 3) we have 

sup |fT| = |T'|. 

\fi= 
Now f is represented by a vector (f;) in the space of bounded sequences and 
\f| = sup |f;|. Also 


fTx =fL aye, = 2X fai)z; = p is oo 


j=l i=l 


so that 


T| =! Sofas 


t=1 
Thus 
«e | ax 
|T | = sup D fiai; _ = sup D fiai; _s Dd ai; . 
(fj=1 | i=l $ = \fsist) i= ¢ i=l ¢ 
x 
Hence we have shown that if y = Tr is linear on @ to 1, , then > a;;«? and 
Pr - i=1 
|T| = > a;;\_. Now conversely if > a;; «?, we have the iterated sum 
i=l > i=l 
oo x 
Dd az; 
j=l t=1 


converging absolutely and hence for every x ¢€® the sequence 


a 

> 
> a; Trek. 
?=1 


By Theorem 1 then the transformation is continuous. 
Coroutvary |. In case the a;; = 0, the equations (1) define a linear trans- 
formation on Ll, (p > 1) tol af and only if 


This constant ts the norm of the transformation. 
CoroLLany 2. In case a,; = 0, the equations (1) define a linear transforma- 
tion on eo to lif and only uf 


Dd ay <@. 

geil 

This constant is the norm of the transformation. 
Using these corollaries, the corollary to Theorem 7, and Theorem 2 we get 
Coroutiary 3. A linear transformation with ay, = O en l, (lL < p< &) 


or ¢y to Lis necessarily completely continuous, 
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Part of this corollary but not the evaluation of the norm given in Corollary 1 
has been proved in more general form by H. R. Pitt... He has shown that any 
linear transformation on l, to l, is necessarily completely continuous if p > q. 
Littlewood’ has given the same result for p = *, q = 1. 

Coro.tiary 4. If T is linear on ey tol, , and a;; 2 0, then the determinant of 
I + T is absolutely convergent. 

In fact, it follows from Corollary 2 that the determinant of (6;; + a@j;) is a 
normal determinant in the sense of von Koch. 

We conclude with the representation of the general completely continuous 
operation on L (the space of functions ¢g(P) summable on (0, 1)) to Ll, 
(lsq< @). 

TuHeoreM 15. The function T is a completely continuous linear operation on L 
to l,, if and only if it is expressible in the form 


1 

(a) Te = [ k(P)¢e(P) dP 
0 

with measurable functions k,(P) satisfying 


x l/q 
(b) ess sup (= | k(P) ') < ow, 
> 


i=l 


(c) lim ess sup (x k(P) *) = 0. 
n P i=n 
The constant in (b) is the norm of the transformation. 
Proof. It is known’ that 7¢ is linear on L to L, if and only if (a) and (b) are 
satisfied and that (b) gives the norm of the transformation. Let 7,,¢ be defined 
as the vector 


1 1 
(0, 0,---, k,(P)¢(P) dP, I knsi(P)e(P) dP, - - a 
0 0 


/ 


Then 


x“ My 
Tag || < ess up (X k(P) ") Il |. 
r in 
Corollary 1 of Theorem 2 then shows that (¢) is sufficient for the complete con- 
tinuity of 7. Conversely, if 7 is completely continuous, we have for every 
« > O an ne) such that for every ge L with) g |) <1 


fw a\\la 


i 
T.¢ ‘>> | k(Pjg(P) dP f <« (n = n(e)). 

‘HK. Pitt, A note on lntinear forms, Journal of the London Math, Soc., vol. 11 (1936), 
pp. 174-150. 

6 Loc. cit 

* See Dunford, Integration and linear operations, Trans. Amer. Math. Soe., vol. 40 (1936), 
p. 456, or BB. Vullich, Sur les opérations linéaires dans Cespace des fonctions sommables, 
Mathematica, vol. 13 (1937), p. 42, Theorem 1 








peste 





re 


= 








poeioe ned 








TRANSFORMATIONS ON SEQUENCE SPACES 


There exists a ¢, in L with || ¢, || = 1 such that 
|T.| S || Taga || + €. 


' 


Thus for n = n(e) 


2 \ Ma 
'T | = ess wit \k(P)!"> < 2c, 
P j 


(i=n 
and so (¢) is also necessary. 
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ON PERFECT METHODS OF SUMMABILITY 
By J. D. Hii 


1. Introduction. In this paper we are concerned exclusively with Toeplitz 
methods of summability in the real domain, and we begin by introducing the 
definitions and notations which we shall employ. Being given a matrix 


A = (a,x) (k,n = 0,1, 2, --- ) and a sequence x = |s,}, we may form the new 
x 

sequence y = A(x) = }{t,} provided each of the series . Ans = t, = A,(x) 
k=0 


is convergent. If y belongs to the space (c) of convergent sequences, we say that 
x is summable by the method A, or simply A-summable, and we write A-lim z = 
lim y. The elass [A] of all A-summable sequences is called the convergence-field 
of A. If for two methods A and B we have the relation [A] C [B], we say 
that B is not weaker than A. A and B are said to be consistent if A-lim x = 
B-lim x whenever these limits exist. The method J defined by the matrix 
(6,,.), where 6,. is Kronecker’s symbol, is called the identical method or the 
identity; obviously [J] = (c). Every method A for which [J] C [A] is called 
convergence-preserving; if, in addition, A is consistent with J, it is said to be 


regular. If the matrix (a,,) is such that a,, = 0 for k > n, A is said to be 
triangular; if, furthermore, a,, # 0 for every n, A is said to be normal. A will 
be called reversible if the equation A(z) = y has exactly one solution x, con- 


vergent or not, for each y in (ce). For triangular methods the notions of re- 
versibility and normality are easily seen to be equivalent. 
For future reference we list here the following conditions which are necessary 
and sufficient for A to be regular: 
(1.1) lim a,, = 0 (k = 0, 1, 2, ---), 
(1.2) lim er 4, 
nh kel 
(1.3) > jaun| SK (n = 0, 1, 2, ---). 
ke 
We shall say’ that A is of type M if the conditions 


(1.4) Dlial<«, Daa. =0 (k = 0, 1,2, ---) 


n=O ” 0 


Received August 16, 1937. The author gratefully acknowledges his indebtedness to 
Professor J. D. Tamarkin, under whose direction this paper was written 

1 Matrices of this type were first introduced by Mazur in connection with normal 
methods; see Kine Anwendung der Theorie der Operationen bei der Untersuchung der Toe- 
plitzschen Lamitierungsverfahren, Studia Mathematica, vol. 2 (1930), pp. 40-50. We shall 
refer to thir paper hereafter as SM 
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always imply 
(1.5) a, = 0 (n = 0,1, 2,--- ). 


Banach’ calls a method perfect if it is simultaneously regular, reversible, and of 
type M. The importance of perfect methods lies in the following theorems. 

THEoreM 1 (Mazur).’ In order that a normal reqular method A be consistent 
with every regular method not weaker than A, it is necessary and sufficient that A 
be of type M. 

Banach has shown that the sufficiency may be extended to methods which 
are not necessarily normal. 

THeorem 2 (Banach).* In order that A be consistent with every regular method 
not weaker than A, it is sufficient that A be perfect. 

The only known examples of perfect methods are the Cesdro and Euler 
methods of all positive orders, a result obtained by Mazur in SM. It is the 
purpose of the present paper to find conditions under which the Nérlund, 
Hausdorff, and weighted-mean methods will be of type M. We start, how- 
ever, with a few theorems of a general nature. 


2. Perfect methods in general. [Let A = (a,,) be a given regular matrix. 
If we regard A(x) as an operation on (c), the regularity insures that its range 
R. will be a subset of (c), and we then have the following characterization of 
perfeet: methods. 

TueoreM 3. In order that a regular and reversible method A be of type M, it is 
necessary and sufficient that R. be dense in (ce). 

Proof. The assertion of the necessity is merely an alternative statement of a 
result due to Banach.’ To establish the sufficiency consider the points y; = 


6,0} of (ec), (@ = 0,1, 2,---). If R. is dense in (c), there exists corresponding 
to each « > O and each 7 = 0, 1, 2, --- a convergent (and hence bounded) 
sequence {si} = 4;, such that | A(@,) — yi) < €, which is equivalent to 

(2.1) A,(x:i) — bn; | < « (n,t = 0,1, 2, --- ). 


We write (2.1) in the form 
(2.2) A, (ri) = 8:5 + eg, Where | eg) < 
and assume that (1.4) holds. Then for each fixed ¢ it is clear from (1.3) and 


a 
the boundedness in k of { sy; } that : @, Ay 8e; 8 Absolutely convergent. Conse- 
Keone’ 


2 Théorie des Opérations Linéaires, p. 90 

3 See SM, p. 48, Satz 7. It may be remarked here that Mazur (see Mathematische 
Zeitschrift, vol. 28 (1928), pp. 604-605) has constructed two normal regular methods, each 
not weaker than the other, which assign different limits to a particular sequence 

* Loc. ecit., p. 95, Théoréme 12 

® Loc. cit., p. 938, Lemme 2. 
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quently, from (1.4) and (2.2) we have 


“x x 20 ( ~*~ \ ~2 
0 = p a A, Ante Sei = 8 \& Anh Sui fe = > A, (aia, 
k=0 n=0 n=0 (k=O n=0 
x x nm 
- > Oni@, + + €, ia — + aii 


x 
Therefore a; < nae i a, , and, « > O being arbitrary, it follows that 


a; = 0( = 0,1, 2,---). This completes the proof. 

On the other hand, if we consider A(x) as an operation defined in the space 
(m) of bounded sequences, its range F,, lies in (m) on account of (1.3), and it is 
readily seen from the foregoing proof that the following theorem holds. 

TueoremM 4. In order that a regular and reversible method A be of type M it ts 
necessary and sufficient that the points y; = {6,:} of (m) be limit points of R,, 
for alli = 0, 1, 2, --- 

THeorem 5. The product AB = C of two triangular perfect methods A and B 
is also a triangular perfect method. 

Proof. If the matrices of A, B, and C are, respectively, (@nx), (b..), and 


(en), then exe = Do aniby if k < n, and c,. = Oif k > n, so that C is triangular. 
i=k 
Moreover, it is obvious that C is normal and regular. Conditions (1.4) applied 


to C give 


(2.3) 7 AnCnk = : > a,Annbi. = > ( cen tns bs =Q@Q (kt =0,1,2,---), 
nak a=k imk ik nme 


where the interchange of summation signs is permitted by the absolute con- 
x 


vergence of . a, a, by. Setting 


n=O 
(2.4) B; = Dd andy, Gi = 0, l, 2, ie -), 
we have 2. Bi. < = and - Biby = 0 fork = 0, 1, 2,---. Since B is of 
re ieh 

type M, this implies 8; = 0 fori = 0,1, 2,---. This in turn, by (2.4) and 
the fact that A is of type M, implies a, = 0 for every n. Thus C is also of 
type M, and the proof is complete. 

THeorem 6. If the product AB C of two triangular convergence-preserving 


methods A and B is of type M, then A must be of type M. 

Proof. Vf A is not of type M, there must exist a sequence {a‘,} satisfying 
(1.4) but not (1.5). Sinee (1.3) is also necessary for preservation of con- 
vergence, the relation (2.3) can then be established for the sequence fas}. 
This contradicts the assumption that C is of type M. 

We shall have occasion later to use the following theorem, the proof of which 
is easily supplied. 

Tueonem 7. If A is normal and B is triangular, then B is not weaker than A 
if and only if BA * is convergence-preserving. 
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3. The Norlund method. The Nérlund method of summation corresponds 
to a triangular matrix whose elements are of the form a,, = pa»-«/P, for 
k = 0,1,---,n;n = 0,1, 2,---, where {px} is a given sequence such that 
P, = po + pi + --- + pa # O for all n. In particular, for n = 0 we have 
po ~ 0, and we may always take po = 1 since p, may be replaced by p;/po 


without affecting the matrix. Then in view of a,, = 1/P, # 0 we see that 
every Noérlund matrix is normal. Furthermore, since > p/P, = 1 
k=0 


(n = 0, 1, 2, --- ), the regularity conditions reduce to (1.1) and (1.3) alone. 
In the special case where p, = 0 for every k, (1.3) is always fulfilled and on 
account Of prz/Px S Pn—«/Pn-~ the conditions become simply lim p,/P, = 0. 


The question naturally arises whether a regular Nérlund matrix is necessarily 
of type M, and the following example shows that this is not the case. 

Example 1. Let p, = 2 and p, = 0 fork = 2; since p, = 0 and p;,/P, = 0 
if k = 2, the corresponding method is regular by the above remark. On the 
other hand, conditions (1.4), but not (1.5), are satisfied if we let ao = 4 and 
a, = (—4)" forn 2 1. 

THEOREM 8. For a regular Nérlund method to be of type M it is sufficient that 
the sequence 


Pr 1 00 .--- O 
| i a PCeererrrr er rere ere e rere (mn = 1, 2,3, ---) 
Pn-i Pn-2 1 
Pn Pn-i P| 


be bounded, 

Proof. We obtain this result as a consequence of Theorem 4 by imposing the 
condition that R,, actually contain the points y;. This is equivalent to requir- 
ing that the solution {s,;} of the system of equations 


(3.1) A(x) = > (pr—«/Prsei = Sni (n = 0, 1, 2, ---) 
k=O 
be bounded in k for each fixed 7 = 0, 1, 2,---. Observing that the first ¢ of 


the s,; are zero, we may write (3.1) in the form 
(3.2) z. Pn kKSigk,i = Pi4n8in0.6 (n = 0, l, 2, -++) 
k=O 

from which we obtain by applying Cramer's rule to the first 7 + 1 of these 
equations 
(3.3) 8i4j0 = (—1) PD; (i,j = 0,1,2, --- ; De = 1), 
and the theorem follows. 

By condition (1.1) the regularity of a Nérlund method implies that 
Pasi/ Pas (Poss P.)/ Pass 1 (P./ Pay) — 0, or that Pa/Payw — 1 
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ze 
asn—«. Thus the power series > Pz" has a radius of convergence equal to 1, 
n=0 


and consequently the radius of convergence of p(z) = > Paz” = (1 — z) > Paz” 
n=0 n=0 
is at least 1. Combining (3.2) and (3.3) gives the relation ps (—1)*p,-+Di = 
k=0 
6no for n = 0, 1, 2, --- , from which we obtain the formula 
(3.4) 1/p(z) = >, (—1)"D. 2", 
n=0 


valid for | z | sufficiently small since p(0) = 1. Hence for each regular Nérlund 
method there exist positive constants A and B such that | D, | < AB" for all n, 
and the condition of Theorem 8 will be satisfied if and only if B can be chosen 
<1. That the latter is by no means a necessary condition is shown by the 
following example. 

Example 2. For each r = 1, 2, 3, --- there exists a perfect Nérlund method 
for which the corresponding D, is precisely O(n"). For let r be chosen arbi- 


° r ° 
trarily and fixed. Define p, as ( for n = 0, 1, ---, r and as zero otherwise. 
n 


We then have p(z) = (1 + z)’ and 1/p(z) = ites »"(" a \e" so that by 


(3.4), D, = (" ane ') = O(n). 


The p, being non-negative and ultimately zero, the regularity is apparent. 
Let us then assume that conditions (1.4) are satisfied. Fork 2 r these reduce to 


r 


(3.5) Lie) => (‘)as.. =0 (k=r,r+1,r+2,---). 


1=0 
The latter implies L,,(a,) = O for allk 2 r. For we have evidently 
L (ax) = Lalor) + Lralarys) = 0, 
L,(an+1) = L,—-1(a@x%41) + L,-1(ax+2) - 0, 


whence L,:(ax) = L,-:(axs2). Since a, — 0 it is clear that {Z,-1(ax+2:)} for 
each fixed k = r is a sequence of equal terms converging to zero. This es- 
tablishes the assertion. 

It follows then by induction that conditions (3.5) imply Lo(ax) = a, = 0 
for allk =r. The first r equations in (1.4) now take a simplified form from 
which it is obvious that a, = 0 for k = 0, 1, ---,r — 1, and the example is 
complete. 

We conclude this section with a few instances in which the criterion of Theorem 
8 applies. 

Mazur’s result for Cesaro summability (C, a > 0) is immediate. For we have 
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s ~ ‘), p(z) = (1 — 2)“, and 1/p(z) = (—"(*)e" thus 


D, = (*) +00 n—> 2%, 
n 

If p, = 1/(n + 1), we obtain the so-called logarithmic Nérlund method. 

In this case 1/p(z) = —z/log (1 — z) = > (—1)"D,2", and it is known that 


n=0 


Pr = 


the coefficients D,, in this expansion are® O(1/(n log’n)). 

Finally, consider the method defined by the sequence p, = 1 + nd for a given 
d>0. Since P, = (n + 1)(nd + 2)/2, the regularity follows at once. More- 
over, this method is definitely stronger than the identity since one may show 
that it assigns the limit } to the divergent sequence {(1 + (—1)")/2}. We 
have here p(z) = (1 + (d — 1)z)/(1 — 2)’ and 


1/p(z) =1—-(1+d)z2+d@ > (1 — d)”2", 


n=2 
whence D, = d'(d — 1)" forn = 2. Thus if 0 < d S 2, this method will 
be perfect; for other values of d the question remains undecided. 
4. The Hausdorff method. Let {u:} be an arbitrarily given sequence and 
consider the matrices S = (sax), T = (tax), where s,. = (—1)* . tn = we Snr. 


Any matrix of the form H = STS is called a Hausdorff matrix. 
is clearly triangular, and if H = (h,x.) we find that 


(4.01) hu=>. (i ~ a (k = 0,1, ---,n;n =0,1,2,---). 
i=k one 


TM 


uch a matrix 


Since’ S* = IJ it is easy to verify that 

(i) The multiplication of Hausdorff matrices is commutative and the result is 
again a Hausdorff matrix. 

(ii) The inverse of a normal Hausdorff matrix is also a (normal) Hausdorff 
matrix. 

The question whether or not every normal regular Hausdorff matrix is of 
type M seems to be quite difficult and still remains open. However, we shall 
later exhibit (see Example 3) a regular Hausdorff matrix which is not of type M. 
The next theorem, although based on an unverified hypothesis, seems of suffi- 
cient interest to warrant its inclusion. 

TureoreM 9. If Ho is normal, convergence-preserving, and not of type M, and 
if H is not weaker than Ho , then H is not of type M. 


® See Tamarkin, Problem 3276, American Mathematical Monthly, vol. 35 (1928), pp. 
497-500; esp. bottom of p. 500. 

? For a particularly simple proof see Henriksson, Uber die Hausdorffschen Limitierungs- 
verfahren, die schwiicher sind als das Abelsche, Mathematische Zeitschrift, vol. 39 (1935), 
pp. 501-510; in particular, p. 502. 
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Proof. By Theorem 7, H; = HH," is convergence-preserving and from (i), 
(ii) we see that H, = Ho'H. Hence H = HyH, and the result follows at once 
from Theorem 6. 

On the other hand, since Cesiro summability is a special case of the Haus- 
dorff, we know that there exist Hausdorff matrices which are of type M. Con- 
cerning such matrices we have the following result. 

THEOREM 10. If H is normal and convergence-preserving, and if Hy is of type M 
and not weaker than H, then H is of type M. 

Proof. As in the previous theorem we have H,; = HoH = HH», where 
H, is convergence-preserving. Then Hy) = HH, and the conclusion follows 
again from Theorem 6. 

From (i) and Theorem 5 we obtain directly the following theorem. 

THEOREM 11. The product of a finite number of perfect Hausdorff methods is 
likewise a perfect Hausdorff method. 

We propose next to establish conditions sufficient for a regular Hausdorff 
matrix to be of type M. In order that a regular method be defined by (4.01), 

1 


it is necessary and sufficient’ that yu; be of the form yu; = u' dq(u) for i = 
0 

0, 1, 2,---, where q(u) is a function of bounded variation on the interval 

U = (0 S wu S 1) which is continuous at the point u = 0 and which satisfies 


the condition q(1) — q(0) = 1. It is understood throughout this section that 
q(u) will always denote such a function. With this expression for yu; , (4.01) 
n 


1 
reduces to hi, = (x) u‘(1 — u)"~* dq(u), and conditions (1.4) become 
0 


(4.02) Dlaj<«, DL] al”) — w"*dg(u) =0 (k =0,1,2,---). 
k 


n=0 n=k JO 
Since 
(4.03) } “ u‘(1 — u)** = 1 (n = 0, 1,2, ---) 
imo \k 


1 
and I dq(u) = q(1) — q(0) = 1, we observe for future reference that the 
0 


following relation is implied by (4.02): 


(4.04) DD Le aahn = Dy (= has a = Da, = 0. 
k=0 n=k n=0 \k=0 n=0 

We proceed now to reduce (4.02) to a more convenient form. From (4.03) 

n 


A n—k r ‘ 
we have 0 S (2) (1 — u)"“ < 1 on U. Consequently, we see that each 


k 


of the series 


(4.05) gu) = >» a(t) u(1 — u)** (k = 0,1, 2, ---) 
n=k 


* See Hausdorff, Summationsmethoden und Momentfolgen, Mathematische Zeitschrift, 
vol. 9 (1921), pp. 74-109, 280-299. 
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converges absolutely and uniformly on U’, and that for every k 


(4.06) |ge(u) | p> lanl, 
(4.07) gi(u) = (—1)*u‘gs” (u) /kI. 


Thus {g.(u)} is a uniformly bounded sequence of functions, continuous on U 
and analytic for |u—1!< 1. Onaccount of the uniform convergence of (4.05) 
we may write the second part of (4.02) in the form 


(4.08) [ gx(u) dq(u) = 0 (k = 0, 1,2, --- ). 


Now for an arbitrarily fixed ¢t on the interval 0 < ¢ < 1 it follows from (4.06) 
that the series G(t, u) = Zz gx(u)(1 — t)* is uniformly convergent on U. Term- 
k=0 


wise integration is therefore permissible, and from (4.08) we obtain 
1 
(4.09) [ G(t, u) dq(u) = 0. 
0 


But by (4.07), for 0 < u S 1, we have G(t, u) 


> g(u(l — t)* = 


pm go’ (u)(tu — u)*/k! = go(tu). By (4.04), however, go(0) = Zz. a, = 0, 
k=0 


n=0 
and by (4.05), g.(0) = 0 for k = 1, 2,3,---. This shows that the relation 
G(t, u) = go(tu) holds also for u = 0. Making this substitution in (4.09) we 
obtain finally 


(4.10) [ go(tu) dq(u) = 0 (0<t< 1), 


and the problem is reduced to finding further conditions on q(u) sufficient to 
insure that (4.10) shall imply go(u) = 0. 

As a first step in this direction we state the following definition. 

ConpiTi0on C. q(u) will be said to satisfy Condition C if there is an index 
r = 0 such that ¢“*”(w) exists and is bounded on U and if g°*"(1) # 0. We 
shall denote by m the smallest index for which this holds. Obviously we then 
have 


(4.11) gq) = 0 (¢ = 0,1,---,m — 1). 
If q(u) satisfies this condition, we may form the sequence q@(u) = q(u), 
qi(u) = ugq;_s(u) for i = 1, 2, --- , m, and one easily sees that 
(J re 
C,,wq” (u) 
(4.12) qu) = 1% ms (¢§ = 1,2, ---,m), 


\d'q(u)/dz", when u = e* 


where the C;; are certain constants. 
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TuHeoREM 12. If q(u) satisfies Condition C and if constants co, ¢,,--- , Cm 
exist such that 


1 
(4.13) [ ugq,(u) + > e:qi(u) | du < | q"*?(1) |, 
0 


i=0 


then q(u) defines a Hausdorff matrix of type M 
Proof. From Condition C, (4.11), and the first half of (4.12), it follows that 


(4.14) qi(1) = bimqg”*" (1), 

(4.15) q: (u) = O(1) on U, 

Setting s = tu for 0 < t < 1 and writing dq(u) = q’(u) du, (4.10) becomes 
t 


(i = 0,1, ---,m). 


[ go(s)q'(s/t) ds = 0. In view of (4.15) for i = 0 we may differentiate this 

0 

integral and obtain [ go(s)q’’(s/t)(s/t?) ds = q'(1)go(t). Recalling (4.14) for 
0 


1 
= (0 and returning to the variable u, we have I go (tu)uq’’(u) du = 0. This 
0 
1 
added to the original expression (4.10) gives I go(tu)qi(u) du = 0. This rela- 
0 
tion is simply (4.10) with q(u) replaced by q:i(u), and (4.15) allows us to repeat 
this process until we have 


1 
(4.16) go(tu)q;(u) du = 0 (¢ = 0,1, ---,m). 
0 
Repeating the process again with 7 = m and using (4.14) for the index m, we 
obtain 
1 
(4.17) | go(tu)ugq,(u) du = qn(1)go(t) = q°"*” ()go(t), 
0 


where, by Condition C, q'"*"(1) # 0. Multiplying equations (4. 16) by the 
corresponding constants ¢; and adding the results to (4.17) give q'°"*? (1)go(t) = 
[ go(tu)(uq,(u) + ye q.(u)) du on the interval 0 < ¢ < 1. Letting go = 
0 


max go(t) | on (0, 1), we obtain the inequality 


1 m ! 
giq (is w [ ugq,(u) + _ ciqi(u) | du. 
0 =O 
Consequently, if (4.13) holds we must have go = 0, and the theorem follows. 
Corotitary |. If q(u) has a bounded second derivative, and if a constant 
ce < 1 exists such that uq’(u) + eq'(u) 2 0 on U, then the corresponding Haus- 
dorff matrix is of type M. 


1 
Proof. Since « < 1 the relation 0 S [ron + cq’(u)|du = 











‘= 
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1 1 
| uq’’(u) du + ef q'(u) du = q’(1) — 1 + « gives q’(1) > 0, and the hy- 
0 0 


potheses of Theorem 12 are therefore satisfied with m = 0, c@ = c. 

From this fact we obtain at once the following result. 

Coro.uary 2. If q(u) has a bounded non-negative second derivative on U, then 
the corresponding Hausdorff matrix is of type M. 

It seems desirable to show as a final corollary that the criterion of Theorem 12, 
together with Theorem 10, enables us to reproduce Mazur’s result for Cesdro 
summability. 

Coro.tuary 3. The Cesdro matrices of all positive orders are of type M. 

Proof. In this case we have q(u) = 1 — (1 — u)*, wherea > 0. We assume 
first that @ is an integer. Then it is obvious that qg(u) satisfies Condition C 


with m = a — 1. Moreover, g(u) = > (-1(2)u, and from the second 
i=l 


part of (4.12), qg.(u) = Zz, (ay (S) for s = 0,1,---,@— 1. Conse- 
i=1 


quently, we have 


uq.(u) + = c.q.(u) = > (—1)' i(*\(E Me. + (i - pi) r 
s=0 i=l 


s=0 


Hence by choosing the c’s to satisfy the equations 


a-l 
(4.18) Dd te. + G — Ii" = 0 (i = 1,2, ---, a) 
s=0 
as it is clear we may do, we see that the integrand in (4.13) vanishes identically, 
and the result for integral orders is established. But since the strength of the 
Cesiro method increases with the index, the general conclusion follows from 
this by Theorem 10. 

One might be led to suspect that the preceding argument could be applied 
to obtain the desired conclusion when q(u) is an arbitrary polynomial. It 
turns out that such is not the case, however, since in general the system of 
equations corresponding to (4.18) is inconsistent. 

THEOREM 13. The function q(u) = u” for p > 0 defines a perfect Hausdorff 
method. 

Proof. The normality and regularity are apparent. Let us then assume that 
(4.08) holds and replace therein g,(u) by its expression given in (4.07). Since 
dq(u) = pu’ du we find on integrating by parts that 


1 1 
0 = (p+ 0 | u‘ gs" (u) pu” ‘du = pgs (1) — [ ug? (u) pu? du. 
0 0 


Thus gi(1) = 0 for k = 0, 1, 2,---. From this the theorem follows. 

It may be remarked that this result for p 2 2 is an immediate consequence 
of Corollary 2, and, for p = 1, of the ensuing Theorem 15. 

The next two theorems deal with certain general classes of monotone func- 
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tions. To facilitate the statement of the first, we introduce the following 
definition. 

ConpiTion D. Any function qg(u) defined as follows will be said to satisfy 
Condition D. For an arbitrarily given v, 0 < v < 1, let U; = (0 S wu < v) 
and U; = (v Su Zl). On U;, let g(u) be monotone increasing with q(0) = 0, 
and suppose that r exists, 0 < r < 1, such that qv — 0) S r/(1 +r). Finally, 
at each point of U:, let q(u) be equal to 1. 

TuHeoreM 14. If q(u) satisfies Condition D, then the corresponding Hausdorff 
matrix is of type M. 

Proof. For each i = 1, 2, 3, --- let 0 = up < ui <--- < Wai =v bea 
mode of subdividing the interval 0 < u S v such that the maximum length 
of the subdivisions tends to zero as i > ~. Let Qi = q(ui) — q(ui_s) for 
$s = 1, 2, see ,mt+ i. 

Assuming that (4.10) holds, we set 


mj+l 


(4.19) Git) = Y& goltud)Q! (0 <t<}), 


and we have then for each ¢, lim G,(t) = 0. By Condition D, 


is 


Qiu =1— qu) = 1/1 +n), 
0< > Qi /Qinu1 = q(ui,)/( — qui.) Sr <1, 


fori = 1, 2,3,---. Consequently, if we let go = max | go(u) | on (0, v), we 
obtain from (4.19) the inequality | go(vt)| S | Gi(t)/Qhisi| + rg. Letting 
i— » gives | go(vt)| S rgofor0 < vt S v. This is not possible unless go = 0. 
Thus go(u) = 0, and the theorem is proved. 

It is evident that no essential change is necessary in the above proof if v < 1 
andU, = (0s usv),U2=( <uS& il). 

TuHeoreM 15. If q(u) is continuous on U and has a derivative for0 <u < 1 
which is non-negative and non-decreasing, then the corresponding Hausdorff matrix 
is of type M. 

Proof. Under the given conditions q(u) is absolutely continuous and (4.10) 
may be written 


(4.20) [ sntnsa' cw du = 0 (Ost 1). 
If we assume that go(u) is not identically zero, it follows that the function 
(4.21) h(u) = . go(t) dt (0 < u < 1) 
* The proof given here parallels an argument ascribed to Ek. J. McShane, which applies 
directly to the function q(u) = (2/r) sin~! u. See Bonnesen und Fenchel, Theorie der 


Konveren Kérper, p. 138. I am indebted to Professor Hans Lewy for calling this to my 
attention. 
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(which may be continued analytically into the circle | wu — 1 | < 1) must assume 
both positive and negative values. For integrating (4.20) with respect to ¢ 
from 0 to v (0 S v S 1) and interchanging the order of integration we obtain 


fi 
(4.22) | — q'(u) du = 0 (Os vl). 


Since q(1) — q(0) = 1, q’(u) cannot be zero almost everywhere and hence the 
assumption that h(u) is of one sign or zero implies that A(u) is zero on some 
set of positive measure. This is a contradiction to (4.21) unless go(u) is iden- 
tically zero. 

Let us denote then by —m the minimum value of the function A(t)/t, which 
from (4.21) is continuous for 0 < ¢t < 1, if the value 0 is assigned at ¢ = 0. 
Suppose that h(t)/to = —m, where 0 < & S 1, and set p(t) = A(t) + mt so 
that p(t) = 0, p(t) = 0. Then for the function 


Fi) = [2 q'(u) du = [= q'(u) du + mt [ vo du 


we get from (4.22), F(t) = mt(q(1) — q(0)) = mt, and thus 
(4.23) F(t) — F(t) = mit — to), m> 0. 


On the other hand, if f& < 1 and & < ¢t S 1 a simple calculation gives 


F@) — Fle) = [ Pe ara o (‘t)) a + i Plt) ora) au. 
0 u to to/t U 


The first integral in view of p(t) 2 0 and the monotone property of q’(u) is less 
than or equal to zero. Furthermore, for &/t S u S 1 it follows from (4.21) that 


p(tu) = p(tu) — p(toc) = A(tu) — A(t) + m(tu — tb) S (max | go | + m)(t — to). 


This shows that the second integral fs o(f — t). These estimates provide a 
contradiction to (4.23) and complete the proof in case fe < 1. A similar argu- 
ment applies when tf = 1. 

We conclude our discussion of the Hausdorff method by showing, as pre- 
viously mentioned, that there exists a regular Hausdorff matrix which is not 
of type M. 

Example 3. The polynomial Q(u) = 16u° — 27u° + 12u defines a regular 
Hausdorff matrix which is not of type M. Since Q(1) — Q(0) = 1, the regu- 
larity is clear. Now let us choose a = 0, a = —1, and a, = 1/(n(n — 1)) 


2 a 


forn 2 2. Then we have | # lan | < ©, g(u) = > a,(1 — u)" = u log u, 


n=) n=O 
and (4.07) gives gi(u) = —u(l + log u), ge(u) u/(k(k — 1)) for k 2 2. 
Consequently, conditions (4.08) reduce simply to the two conditions 
1 


: ; 
| udQ(u) = 0, | (u log u) dQ(u) = 0, and one easily verifies that these are 
0 e 0 


satisfied. Thus the matrix defined by Q(u) is not of type M. Moreover, 


> 


Il 
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since the diagonal elements of this matrix are given by 

han = [ u" dQ(u) = 6(n — 1)°/((n + 1)(n + 2)(n + 3)) (n = 0,1, 2, ---), 
we see that the normality is destroyed by (and only by) the vanishing of hy . 


5. The weighted-mean method. The weighted-mean method is defined by a 


triangular matrix whose elements have the form a,, = px/P, for k = 
0,1, 2,---,n;n = 0, 1, 2,--- , where the sequence {p,} is such that P, = 
Po + prt --» + pn # Ofor all n. If such a matrix is normal, we must have 


Ann = Pn/P, # 0, or p, # O for every n. In this event, conditions (1.4) reduce 


to Dan, P,, = 0(k = 0,1, 2, --- ), which clearly imply a, = 0 (n = 0,1. 2, --- ). 
n=k 


Thus we have established the following theorem. 
THEOREM 16. Every normal weighted-mean matrix is of type M. 


Brown UNIVERSITY AND MICHIGAN STATE COLLEGE. 























QUASI-UNITARY MATRICES 
By JoHN WILLIAMSON 


Introduction. Let /,, be the n-rowed square matrix 
E,, 0 
0 —E,_m : 


where £; is the unit matrix of order j. Then J,, is the normal form of a non- 
singular Hermitian matrix of index m under a non-singular conjunctive trans- 
formation. A matrix A, whose elements are complex numbers, which satisfies 


(1) AI,,A* = I,, 


where A* = A’ is the conjugate transposed of A, will be called a quasi-unitary 
matrix. In particular, if m = n or 0, A is a unitary matrix. A matrix, A, 
which satisfies (1), is a conjunctive automorph of the Hermitian matrix /,, . 
The conjunctive automorphs of a non-singular Hermitian matrix have been 
studied by Loewy.' He has shown how the nature of the elementary divisors 
of A — XE is restricted by the index m of the matrix J,,. In the following 
paper we derive normal forms for quasi-unitary matrices under quasi-unitary 
transformations, and in doing so are inevitably led to Loewy’s results. (See, 
for example, the remark following Theorem 2.) We also determine necessary 
and sufficient conditions for the similarity of two quasi-unitary matrices under 
a quasi-unitary transformation. In particular it is shown that two quasi- 
unitary matrices which are similar are not necessarily similar under a quasi- 
unitary transformation. In §2 the similar problem for real quasi-orthogonal 
matrices is considered, and in §4 an interesting property of the elementary 
divisors of a pencil, whose base is 7,, and a canonical quasi-unitary matrix, is 
deduced. 

As many of the proofs are in essence the same, subject to obvious modifica- 
tions, as those in a previous paper,’ for the sake of brevity they will be omitted. 


Received May 4, 1937; in revised form September 21, 1937. 

1 Alfred Loewy, Allgemeine bilineare Formen konjugirt imagindren Variabeln, Abhand- 
lungen der Kaiserlichen Leopoldinisch-Carolinischen Deutschen Akademie der Natur- 
forscher, vol. 71 (1898), pp. 377-446; Mathematische Annalen, vol. 50, pp. 557-576. The 
second of these papers gives a short account of the results proved in the first. The term 
quasi-unitary was first used by Harold Hilton, Properties of certain homogeneous linear 
substitutions, Annals of Mathematics, (2), vol. 15 (1913), pp. 195-201. 

? John Williamson, On the normal forms of linear canonical transformations in dynamics, 
American Journal of Mathematics, vol. 59 (1937), pp. 599-617. This paper will be referred 
to as W. 
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1. The problem that we first consider, then, is the following. Let A; and A: 
be two matrices, which satisfy 


(2) Ail, A; = Ens (¢ = 1, 2, 3); 


to determine necessary and sufficient conditions that a third matrix A; , satis- 
fying (2), exist and satisfy 43A2A3;' = A,. 

If A; and A: are similar and a matrix Q, to be specified later, is similar to A; , 
then Q is also similar to A2. There accordingly exist two non-singular matrices 
R, and R;, such that 


R:A;:R;" = Q (¢ = 1, 2). 
The matrices 

S; = RilnRi (¢ = 1, 2) 
are Hermitian and are left invariant by Q; that is, they satisfy 

QS;,Q* = 8S; (¢ = 1, 2). 


Accordingly, if Q is any matrix similar to both of the quasi-unitary matrices 
A; and Ag, there is associated with A, a Hermitian matrix S,; and with Ae a 
Hermitian matrix S, , both of which are left invariant by Q. 

The problem under consideration is reduced to a similar but simpler one by 
means of 

THEOREM |. A necessary and sufficient condition that the quasi-unitary matrix 
A, be similar to the quasi-unitary matrix A, under a quasi-unitary transforma- 
tion is that there exist a non-singular matrix H, such that 


HQ = QH, 
and that the two Hermitian matrices associated with A, and Ag satisfy’ 
HS,H* = Se ° 


Since, in the above, Q is any matrix similar to A, , we are at liberty to choose Q 
in a suitable normal form. Then, if S is any Hermitian matrix, which satisfies 
the equation 


(3) QSQ* = S, 
we shall first determine a unique normal form for S under non-singular con- 
junctive transformations by matrices commutative with Q. If HQ = QH 


and HSH* = T, we shall call the transformation by the matrix H an admissible 
transformation and shall write S = T. 

Let the matrix Q, which is a normal form of the quasi-unitary matrix A under 
similarity transformations, be chosen in the diagonal block form 


Q = [Q: , Qe, --- , Qel, 


* For proof, see W, Theorem 1. 
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where no latent root of Q, has absolute value one, each latent root of Q;,j > 1, 
has absolute value one, and, if 7 # j, no latent root of Q; is the same as a latent 
root of Q;. Then, if (8) is satisfied, the matrix S is also a diagonal block 
matrix, 


S = [Si, S2,---, Si] 
and 
(4) Q:S:Qi = 8; (¢ = 1,2,---,k) 


(W, Lemma 2). Since (4) is the same as (3) except for the suffix 7, we need 
only consider two special cases of Q: 

Case 1. No latent root of Q has absolute value one. 

Case 2. Each latent root of Q is equal to p, where p is of absolute value one. 

Case 1. Since Q is similar to (Q*)™’, Q is similar to the diagonal block matrix 
[F, (F*)']. As a consequence of the remark following Theorem 1, we may 
replace Q by this matrix; that is, we may write Q = [F, (F*)™']. With this 
value of Q, the matrix S is of the form 


(r- «) 


where 7' is a square matrix of the same order as F. The transformation of 


matrix 
r = 
(0) 
is admissible and 


0 E 
(5) s~( )-a 
E 0 


Hence we have 

Resutt 1. If no latent root of A has absolute value one, the matrix Q may be 
taken in the form [F, (F*)"]. Then S = G. 

The matrix F is not unique and may be replaced by any matrix similar to it, 
the classical canonical form, for instance. As a consequence of Theorem | we 
therefore have 

THEOREM 2. If A, is a quasi-unitary matrix similar to a second quasi-unitary 
matrix Az and, if no latent root of A, is of absolute value one, then A, is similar 
lo Az under a quasi-unitary transformation. 

The fact that in this case the index of J,, must be one half the order of J,, 
is a known result." 


* Alfred Loewy, loc. cit. 
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Case 2. Since each latent root of Q is equal to p, we may take Q in the 
canonical form 


Om .  Fas***s Feb 


where 
(6) P., = pE; + pU;, 
and E; and Ul’; are respectively the unit matrix and the auxiliary unit matrix 
of order e;. The elementary divisors of A — AE are therefore 

(A — p)” ((=1,2,---,t3q4 2e 2--- De). 
If 
(7) S = (S,.) (r,s = 1,2,.---,98), 


is a partition of S similar to that of Q, it can be shown first that S = T = 

(T2), where T;; is non-singular (W, §4); then that 7’ = [S,, S:, --- , S.J, where 
+ p* » 

(8) P.,S;P., = 8; 


(W, Lemma 3). Equations (8) are of two distinct types: type (1), the matrix 
P., = P is of even order 2m; type (2), the matrix P is of odd order e = 2m + 1. 
Type (1). The reduction, used in W, type 6, shows that 


2 ¥y 0 Xp 
S; = W; = d;X; = d; . 
i i i Xn 0 


where Xj. is a uniquely determined square matrix, all of whose elements are 


° , rt . 5: 
integers, and Xx, = —Xj.. For example,’ if m = 4, 


Xs = 
1 1 0 0 
|-1 oO Oo 0| 
Since W, = W?, d; = a;i, where a; is a real number different from zero. 


The admissible transformation by the scalar matrix EB/+/|a;| shows that 
S; = W,; = ¢iX,, where ¢; = +1. Therefore we have 

Resutr 2. If e; = 2m, S; = €;iX;, where «¢; = +1 and X;, is uniquely 
determined. 

Type (2). The matrix S; = ¢,Y;, 6; = +1, where Y; = (y,.) is a uniquely 
determined matrix, for which 


Yre = 0 (r,s = 1,2,---,m;sy,. = 0,r +8 2 €; + 2), 


®Cf. Turnbull and Aitken, Canonical Matrices, p. 157. 
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(W, type bz). For example, if e; = 5, 


—) 

o 

me 

tojeo 

a S.A! 


Consequently, we have 

Resutt 3. Jf e; = 2m + 1, S; = €;¥;, where ¢; = +1 and Y; is uniquely 
determined. 

We see from Results 2 and 3 that, with each elementary divisor (A — p)‘ 


of A — XE, where |p| = 1, is associated an e, which has the value +1. 
Therefore, if (A — p)° occurs exactly ¢ times among the elementary divisors 
of A — XE, with this elementary divisor is associated a set of ¢ positive and 


negative signs. We may call the number of these positive signs the index of 
the elementary divisor (\ — p)’. We are now able to prove (W, Theorem 4) 

THEOREM 3. Necessary and sufficient conditions that two quasi-unitary ma- 
trices A; and Ag be equivalent under a quasi-unitary transformation are that 

(a) the elementary divisors of A; — XE be the same as those of Ar — XE, and 

(b) the indices of all elementary divisors (\ — p)‘, |p| = 1, be the same for 
both pencils. 

Theorem 3 includes as a special case the known theorem that two unitary 
matrices which are similar are similar under a unitary transformation. For, 
if A is unitary, only elementary divisors of type (2) may occur with e; = 1 
and the corresponding indices must all be one (or zero). 


2. Real quasi-orthogonal matrices. The above arguments are valid in the 
real field, if the complex number 7 is replaced by the two-rowed real matrix 
0 1 
—1 0 
and the complex number p = a + 7b of unit modulus by the real orthogonal 
matrix 
a b 
—b a 
The elementary divisors (A + 1)" of type (1), e; = 2m, now must be considered 


separately. In this case, since the matrix Sy, in (7) is a symmetric matrix of 
even order, all of whose elements are real numbers, Sy is necessarily singular. 
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However, after at most a rearrangement of the rows and columns of S, we may 


suppose that 
™ *) 
Sor Sere 


is non-singular. Then [P, P] may be replaced by [P, (P*)~’] and 


Su *) (; ‘) 
Sn Sn) \E 0 


(W, type a). Accordingly, an elementary divisor (A +1)*" must occur an even 
number of times and no index need be associated with it. We have therefore 
THEOREM 4. Two real quasi-orthogonal matrices A; and A: are similar under 
a real quasi-orthogonal transformation, if and only if 
(a) the elementary divisors of A, — XE are the same as those of Az — XE, and 
(b) the indices associated with each pair of complex elementary divisors (A — p)*, 
(A — p)*, |p| = 1, and with each elementary divisor (\ + 1)**** are the same for 
both pencils. 


3. Normal Forms. In determining possible normal forms for a quasi-unitary 
matrix under quasi-unitary transformations we first reduce the matrices X; 
and Y; of Results 2 and 3 to simpler forms. In so doing we naturally alter the 
matrices P,,; . 

Type (1). Since e; = 2m, we may write 


Pe Rn 
Pom = ’ 
0 Pas 


where f,, is a square matrix of order m, whose only non-zero element is the 
element p in the first column and last row. Then, if 


E. 0 
i= , |» Where ¢; = «, 
0 —ieXo 
0 E,, Pa — ei Lim 
HaiXH* =| _ =G, and HP»H' = 


where L,, is the matrix, whose last row is 


(9) (p, ieee ot % (—1)” 'p), 


all other rows being zero. ‘Therefore, since « = +1, we have 
Resutt 2a. If e; = 2m;, Pen; may be replaced by 


, Pa, eile, 
"Xo ey 


Then Sam; od Gm; . 
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Type (2). Since e; = 2m + 1, we may write 


lo 
Pomsr = ’ 
0 Parga 


where R,, is a matrix of m rows and m + 1 columns, whose only non-zero ele- 
ment is the element p in the last row and first column. The corresponding 
matrix Y in Result 2 is of the form 


(> 0) 
Y= , 
D 0 


where D is a non-singular (m + 1)-rowed matrix and C consists of the first m 


rows of D*. If 
(" 0 ) 
H = ; 
0 «D" 


Pa M,, 
HeY H* = [G,,, (—1)"] and HPomiiH” = ( ee ° 
0 (Pins) , 


where M,, = ¢R,,D. The first m — 1 rows of M,, are therefore zero and the 
last is 


(10) (dep, —3ep, dep, ---, (—1)™ “dep, (—1)"ep). 


On substituting (—1)"e for « we have 
Resutt 3a. If e; = 2m; + 1, P.; may be replaced by 


Pn, (—D™Mn, 
Z., = iris | 
0 (Pn j+1) 
Then S.; = [Gn;, €l. 


If O is the real orthogonal matrix 


E E 
O=23 ; 
-E E 
then 


(11) OGO’ = [E, —E}. 
Further, if Z = [F, (F*)'], where F is the matrix of Result 1, 
Bu By F+(F*)' —F + (F*)" 
(12) OZO' = B= = 43 - 4 9 
Bn By —F + (F*) F + (F*) 
and, if Z has the value given in Result 2a, 
P+(P*)*+eab —-P+(P*) "+ ) 


(13) O0ZO’ = B= i( e ¥ 
—P+(P*%)"-el P+(P*%'- «iL 
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In reducing the matrix [G,,, , «] of Result 3a to diagonal form it is necessary to 
make a further partition of the matrix Z,.,. Accordingly we write 


m r * —1 (P*)* 0 
Po; = F. ( =» fl) 1M; = Nz, (Pm,+1) = -) J? 
y* = (p*) 
where N is an m-rowed square matrix, x a matrix of a single column, and y* 
a matrix of a single row. An easy calculation shows that 


(14) [O, 1][G, [O, 1)’ = [E., —En, 4), 

and that 

(15) [O, 1]Z[O, 1) = B = (B;;) (i,j = 1, 2), 
where 


Bu={P+N+(P*%)"}2", Be =({|-P+(P*)'+N}2", 22>), 


{-P—N + (P*)"}2" (P+(P*)"'-—Nj2", —227 
By = oe ’ By = = . 
( y*2" ( y*2", _ 
If « = —1, the matrix on the right of (14) is [E,, —Enyaj. Ife = +1, a 
simple interchange of rows and the same interchange of columns reduces the 
matrix on the right of (14) to [Z,.1, —E,,J]. Accordingly, if « = 1, there 
exists a real orthogonal matrix 0; such that 


OIG, 10; = [Eau , —Enl, 


and 
(16) 0,20; = B = (B;;) (i, j = 1, 2), 
where 
(P+N+4(P%)"j2", 22? [-P+N4+4(P*) 2" 
By = j ip By = 1 ’ 
y*2 ’ (p*) y*2 


Bn = ({-P—N+(P*%"j2", -22"), Be={P—N+ (P*‘j2". 


Thus each matrix Z; in Results 1, 2a and 3a is similar under a real orthogonal 
transformation to a matrix B,; given by one of the equations (12), (13), (15) 
or (16) and the corresponding matrix S; = [E,, —£,]. Let Bi, Be, ---, By 
be the complete set of matrices B;, described above, obtained from a quasi- 
unitary matrix A, and let 


B, = (B,.:;) (i,7 = 1, 2: 7, = oe Te y 3% 


Then, if 

(17) C = (C,;;) (3,7 = 1, 2), 
where 

(18) Ci; = (Biz, Bei, --> , Basil, 


it follows that A is similar to C and that S = J,,. We have therefore proved 











red 
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THEorEM 5. A quasi-unitary matriz A is similar under a quasi-unitary trans- 
formation to one and essentially only one of the matrices C defined by (17) and (18). 


4. Elementary divisors. We now prove three lemmas. 

Lemma 1. The elementary divisors of [Pn , (P%)~'] — XG» are all linear and 
of the form X — w, where w is of absolute value one. 

Let 


Pr. —\E,, 
\—rxBE. (P2)7| 


Then‘ 
A = |P,(Ps)"* — BE. |. 


But, if a = p/p = &”, it follows from (6) that 
0 a 0 as O] 
0 0 a ... 0 
P,,(P2)* = aii ears tbe wee Waa 
| 0 0 0 sss @ 
\(-1)""a@ (—1)"*e (—1)""*a areata a } 
Therefore, if \ = ye”, 
A = (—1)"a"[p" — w+ — --- + (—1)"), 


= (-1)"a"(u""? + (—1)")/(u + 1). 


The roots of A = 0 are accordingly all distinct and of absolute value one and 
the lemma is proved. 

We have as an immediate 

CoroLuaRy. The matrix F of Result 1 may be so chosen, that the elementary 
divisors of [F, (F*)'] — \G are all linear and of the form } — w, where w is of 
absolute value one. 

Lemma 2. If Z,, is the matrix of Result 2a, the elementary divisors of Z, — Gm 
are all linear and of the form \ — w, where w is of absolute value one. 

If A = |Z, — dG, |, then’ A = | P,.(PS)" + AciLn — NE |. 

Since p is of absolute value 1, p = e“ and (p*)"' = p. Hence on substituting 
for L,, its value given by (9) and on writing \ = (—1)™ ‘eie“uw and f = 1 + u, 
we find that 


u l 0 eee 0 
a ems 0 yh 1 eee 0 
(— 5)" 'f (— 1)" *f (—1)" f a u 4 f 


= ‘x om 4 ms 1 we Ps 2 abe aes + bu 4 1). 
Consequently the roots of A = 0 are all distinet and of absolute value one. 


* J. Williamson, The expansion of determinants of composite order, American Mathe- 
matical Monthly, vol. 40 (1933), p. 67, formula 7. 
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Lemma 3. If Z is the matrix of Result 3a, the elementary divisors of Z — Gn , 
are all linear and of the form \ — w, where w is of absolute value one. 

If p = e’’ and A is the determinant of this pencil, we deduce, as in Lemmas 
1 and 2, that A is equal to a determinant of order m + 1; in fact 


oi? er"? ae 0 0 
0 —r° oe 0 0 | 
aus ure «-- e-8 a | 
(—1)"e* (—1)™'e® ... -e® ec — re! 


where 8 = ¢” — ee”d/2. On writing eA = ye and adding 8 times the last 


row to the last but one, we see that 


om 1 — 0 
0 —_ vikd 0 
(19) Aa gee... ae reas a . 
0 0 os =p Lhe + hy’ | 
(—1)" (-1)™" «+. =] l-» | 
On replacing the last row of the determinant on the right of (18) by 
(—1)"row, + (—1)""row, + --- —rown + (uv + 1)roWn,1, we obtain 
agl 1 shial ll 0 
" 0 —— vw. © 0 
(1 ‘ wd — 2m+1)i6 ‘ | ~ eG (u), 
0 O +s) —w l—by+ ye) 
(—1)” O .--- @O ¥ 


where y = (1 + w’)(1 — w) — (1 — 4u + 4a’) = w(hu — § — ws’). Hence 
o(u) = (—1)"yw™ + 1 — gu + de, 

(—1y* a? — gu + 2) + de’ — fe + 1. 

We now proceed to show that the equation 


(20) o(u) = 0, 


has 2m + 3 distinct roots of absolute value one. If m is odd and yp = e”, 
(19) is equivalent to 

(21) f(t) = cos (2m + 3)t — } cos (2m + 1)t + § cos (2m — I)t = 0. 

Let 


kr 


= =z @.1,2.---,4 3). 
2m + 3 (k 1,2, :am + 3) 








3). 
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If k is odd, cos (2m + 3)t, = —1, while | cos (2m + 1) | < 1 and 
| cos (2m — 1) | < 1. 


Accordingly, when k is odd, f(t,) is negative. Similarly, when k is even, f(tx) 
is positive. Consequently, f(t) has one zero between ¢, and 4; and therefore 
f(t) has at least 2m + 3 distinct zeros between 0 and x. Hence there are at 
least 2m + 3 distinct values of t between 0 and x, such that e*“ is a zero of 
¢@(u). There are therefore 2m + 3 distinct roots of ¢(u) = 0 which have abso- 
lute value one, and, since ¢(u) is of degree 2m + 3, all the roots of ¢(u) = 0 
are distinct and of absolute value one. By a slight modification of the above 
argument we arrive at the same result when m is even. Therefore the roots 
of ¢(u) = 0 are all distinct and have absolute value one. Since the final reduc- 
tion to normal form in §4 was orthogonal, it is a consequence of Lemmas 1, 
2 and 3 that, if C is the matrix defined by (17), the elementary divisors of 
C — XI,, are all linear and of the form \ — w, where w is of absolute value one. 
Accordingly we have proved 

TueoreM 6. Jf A is a quasi-unitary matrix (unitary with respect to In), 
then A is similar under a quasi-unitary transformation to a matrix C, where the 
elementary divisors of C — XI», are all simple and of the form \ — w, where w is of 
absolute value one. 

If A is unitary, that is, if J,, = EH, Theorem 6 remains true when C is replaced 
by A, and we obtain the theorem that the latent roots of a unitary matrix are 
all of absolute value one. That no such simplification is possible for quasi- 
unitary matrices is shown by the following example. 

If F is an arbitrary non-singular matrix, then 


0 F\ /0 (5 i. 0 E 
(F)' o/\e o/\F* of] \e o/ 


and the elementary divisors of 


0 F 0 E 
—r 
(F*)* 0 E 0 


are the elementary divisors of F — XE together with those of (F*)"' — XE. 
If O is the orthogonal matrix which reduces 


0 ‘) 
E 0 
0 F 
O O'=A 
(F*)"* O 


is quasi-unitary, and the elementary divisors of A — XJ,, are stiil the elementary 
. ’ ’ ’ 1 ’ es 

divisors of F — XE and (F*) " — AF. Hence the elementary divisors of A — XJ, 

need not be simple or even of the type (A — w)‘, where w is of absolute value one. 


to J, , the matrix 


Tur Jouns Hopkins UNIversiry. 











STABLE LAWS OF PROBABILITY AND COMPLETELY MONOTONE 
FUNCTIONS 


By S. BocHNER 


In discussing stability of laws of probability other than the Gaussian, P. 
Lévy’ has proved the following two statements about the Fourier transform 


V,(a) = =f e 24g ltl”? dy. 


© 
I. If 0 < p S 1, V,(a) is non-negative for all real values of a. 

II. If 1 <p < «, V,(q@) assumes in — x < p < ~ both positive and negative 
values. 

It is not hard to prove statement II. As for statement I, a simple proof is 
available in case 0 < p < 3, but this proof cannot be extended to cover the case 
k<p<l’ 

In the present note we shall give two new proofs for statement I. They are 
not of the easiest type, perhaps, but they do not distinguish between the two 
cases and they lead to more general classes of functions having non-negative 
Fourier transforms. 

We shall consider the class 8 of positive-definite functions 


S(a) -[ “dV (a), 


which are Fourier transforms of bounded non-negative distributions. They 
have the following properties: 

l. if fi, fe C B, a, 2 0, a 2 O, then afi + af, C BP; 

2. iffi, fe C B, then fife C BP; 

3. if f. C $B and lim f, exists uniformly in every finite interval, then 
lim f. CB. 

First proof. ‘¥-xcluding the trivial case p = 1, we have to prove that 

f(z) = exp {— |x|} 


belongs to B for 0 < p < 1. Since, for these values of p, 


-% 2p 1 
2p a da = 
Lt = Cy 9) Cp - 0, 


Received September 20, 1937. 

1P. Lévy, Calcul des Probatilités, 1925, pp. 252-277. 

24. Pélya, Herleitung des Gaussachen Fehlergesetzes aus einer Funktionalgleichung, Math. 
Zeitechrift, vol. 1% (1923), p. 109 

*P. Lévy, loc. eit., Chapitre I]; 8. Bochner, Vorlesungen tiber Fouriersche Integrale, 
1932, pp. 63-77 
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f,(x) is, on every finite z-interval, the uniform limit of functions of the type 


n 


2 
a, 


exp ~ he — 
+) 


Hence, by properties 3 and 1, it is sufficient to prove that 


exp { — ar = ¢” -exp ¢- ab 
= e+e x} r+ 


belongs to %. But 
Ps bed a r — 
vo - ; b>", 
on ‘2 + a nao n! valine 


Therefore, because of properties 1, 2, 3, our assertion follows from the ele- 
mentary fact that (2° + b*)” belongs to $8, namely, 
1 [°.; 
(7° +b) = — ef el"! da. 
2b J-« 
A general class of functions including f,(7), for which our argument remains 


in foree, are 


fe(x) = exp |—2°¢(\x))}, 


where 


IV 


g(x) = ts — dy(a) = 0. 

ir 
These functions g(r) are a type of completely monotone functions which have 
been recently investigated by D. V. Widder and R. P. Boas.‘ 

Second proof. We shall use the following lemma.’ If a function f(y) is 
completely monotone in 0 S y < , and y(y) is a function vanishing at the 
origin whose derivative ¥/(y) is completely monotone in 0 < y < &, then 
f(¥(y)) is again completely monotone in 0 Ss y < x. Therefore, 


*= 


Swy)) = e “dy, dy(t) = 0. 


An admissible substitution is 


¥(y) = ¥’, 0O<p<1l, 
since 


“y) = py’ ' = p Hem? dt. 
vy rl — p) Jo ' 


*D. V. Widder, The iterated Stieltjes transform, Proc. Nat. Acad. Se., vol. 23 (1987), 
pp. 242-244. 


5 Completely monotone functions of the Laplace operator for torus and sphere, this Journal, 
vol. 3 (1937), pp. 488-5038 
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Putting f(y) = e ”, we obtain 


e'¥l? = / e “ dy(t), 
Jo 


or 
ett | and. 
0 


Obviously the right side is the uniform limit of finite sums of the type 


> bre, 


v=1 


But «*™” belongs to $ and therefore, by properties 1, 2, 3, the function f,(z) 
does also. 
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ON SOME GENERALIZATIONS OF A THEOREM OF A. MARKOFF 
By Ernar Hite, G. Szec6, anp J. D. TAMARKIN 


I. Introduction 


1. The theorems of A. Markoff and of 8. Bernstein concerning the deriva- 
tive of a rational or of a trigonometric polynomial state that if || f || denotes 
the maximum of the absolute value of a rational polynomial f(r) over a finite 
interval (a, b), or of a trigonometric polynomial over its interval of periodicity, 
then for the derivative f’(r) we have 


(1.1) IS’ Il s An? lif || or |i fl] s Aniifil, 
respectively, where n is the degree of f(x) and A is a constant which does not 
depend on n or on f, but only on (6 — a). In fact A = 1 in the case of ra- 


tional polynomials considered on (—1, +1) and also in the case of trigonometric 
polynomials of period 27. These results can be stated in “abstract” form if we 
consider f(z) as an element of the space C of continuous functions and interpret 
\|f || as the “norm” of this element. A natural question arises then whether 
estimates similar to (1.1) hold if f(x) is considered as an element of other func- 
tion spaces with different definition of the norm. The purpose of the present 
note is to answer this question for rational polynomials in the case of the space 
L,, p 2 1, where the norm is defined by 


| 1 “b 3 \/p 
fi ” if {4 / f(x) ash ° 


2. The corresponding problem for trigonometric polynomials was solved in a 
much more general case by Zygmund' by using an important interpolation 
formula of M. Riesz.” According to this formula we have, for an arbitrary 
trigonometric polynomial of degree n and of period 27, 


2n 


n) 
(1.2) Sf’) | = Qe” |fa@+o")|, 
vel 
) 2) ° ° ° 
where p}"’, 6," are certain numbers which do not depend on f(x) and which 
satisfy 


(1.3) p" > 0, a” =H, 


(1.4) o< gq? <d” < ... < gt’ < Be. 


Received October 2, 1937 
1A remark on conjugate functions, Proceedings of the London Math. Soe., (2), vol. 34 
(1932), pp. 392-400, esp. pp. 394-396. 
2 Eine trigonometrische Interpolationsformel und einige Ungleichungen fiir Polynome, 
Jahresbericht der Deutschen Math. Verein., vol. 22 (1914), pp. 354-368, esp. p. 356, (9), (10). 
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Now let ¢(u) be any non-negative, convex, and non-decreasing function of 
u 20. If we use fundamental properties of convex functions, it is readily 
seen from (1.2) that 


[ “olf |/nddz s | LS pose + 0") |) dz 


0 0 nN y=1 


[ " o() f(x) |) de. 


lA 


The result for L, can be derived from this immediately by choosing ¢(u) = uw’, 
whence 


(1.5) IS Ip Sn |iFilp- 


This is a complete analogue of 8. Bernstein’s classical theorem which can be 
derived from (1.5) by allowing p— <. 


3. The corresponding problem for rational polynomials seems to be more 
complicated because no analogue of (1.2) is known in this case. We show, 
however, that A. Markoff’s theorem still can be extended to the space L, by 
proving the following 

THeoremM. Let p = 1 and let f(x) be an arbitrary rational polynomial of 
degree n. Then 


+1 l/p +1 l/p 
(1.6) if f'(z) a} 4 f f(x) *ae} < An’, 
-1 = 


where A is a constant which depends only on p, but not on f(x) or on n. 

For each n there exist polynomials f(x) of degree n such that the left member of 
(1.6) is = Bn’, where B is a constant of the same nature as A. 

A. Markoff’s theorem (with a less precise value of the constant A) is obtained 
from (1.6) by allowing p— «. Another important case, namely, p = 2, was 
treated some time ago by E. Schmidt.’ The treatment of this special case 
given in Part V below is, as we understand, essentially identical with Schmidt’s 
line of argument. In this special case the constant A above can be charac- 
terized in a more precise fashion than in the general case. 


4. Neither customary methods used for the proof of A. Markoff’s original 
theorem‘ (p = «) nor E. Schmidt’s elegant method (p = 2) seems to be appli- 
cable in the general case. In Parts I] and IIL we give two variants of our 
proof of (1.6) (the first of them valid only for p > 1). Both may present 
interest even in the limiting case p = «. In Part IV we show that the “order” 


* Die asympltotische Bestimmung des Maximums des Integrals tiber das Quadrat der Ablei- 
tung eines normierten Polynoms, dessen Grad ins Unendliche wéchst, Sitzungsberichte der 
Preussischen Akademie, 1932, p. 287. This note contains a statement of the result without 
proof 

*See, e.g., G. Pélya and G. Szegé, Aufgaben und Lehrsdtze aus der Analysis, Berlin, 
1925, vol. 2, pp. 91, 287, problem 23 
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n° of the bound in (1.6) is “the best possible’. The determination of the 
precise value of the constant A seems to be a more difficult problem. 


Il. General case. First method 


1. Our first proof of the inequality (1.6) is based on the following two lemmas, 
one of which is due to Gabriel and the other is an extension of a classical theorem 
of S. Bernstein-M. Riesz.” 

Lemma 2.1. Jf T is any convex closed curve in a complex z-plane and C any 
convex curve inside T, and if F(z) is regular inside and on 1, then 


(2.1) firor zi< ¢ [ire "| dz |. 
c r 


Here \ is any number = 0 and G an absolute constant. 

Let C be any simple closed rectifiable Jordan curve in a complex z-plane and 
let 

z=¥(w) =cwtotaw +---+eow”" +---,ce>0, 

be the function which maps conformally the simply connected infinite domain 
exterior to C into the exterior of the unit circle | w| = 1 in the w-plane. Let 
Cr be the image in the z-plane of the circle wj| = R. With this notation 
we have 

LemMMA 2.2. If f(z) is any polynomial of degree n, then 


(2.2) f(z) || dz| = R"”*" [ f(z) |? | dz |, p> 0. 


CR 
Consider the function 
o(w) = w "“f¥(w))(y'(w))"”. 
It is clear that this function is regular for | w, > 1 (including the point at 
infinity) and therefore the integral 


I, = | o(Re™) |’ do ft i oe") |” de 


as R | 1.7 Here o(c”) = lim o(Re"). Now 


kR—1 


| | f(z) |"|dz| = R?* | o(Re”) |" do = R"’''Te, 
CR 0 


/ S(z) |? |dz| = [ o(e") |" d0, 


and (2.2) follows at once. 


Cf. M. Riesz, Uber einen Satz des Herrn Serge Bernstein, Acta Mathematica, vol. 40 
(1916), pp. 337-347. 

*R.M. Gabriel, Concerning integrals of moduli of regular functions along convex curves, 
Proceedings of the London Math. Soc., (2), vol. 39 (1935), pp. 216-231, esp. p. 229 

7 Concerning the monotony property of /x ef. Pélya-Szegé, loc. cit., vol. 1, p. M4 and 
p. 380, problem 310.) The fact that the limit function ( ) exists and belongs to L, over 
(0, 27) is well known and is trivial in the case which will occur in the subsequent discussion 
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Remark. It is clear that the preceding argument can be applied in the case 
where C is a rectifiable Jordan arc. The only change will consist in replacing 


/ by 2 |. Thus we have 


(2.3) I te) dels ae fisted 


2. We now are prepared to give a proof of (1.6) in the case p > 1; this case 
will be assumed to hold throughout the remainder of this part. Let f(z) be ar 
arbitrary polynomial of degree n. By Cauchy’s formula we have 


(2.4) fe« fle) 


9 eee 


here —1 < x S +1 and the contour of integration C, which may depend on z 
will be specified later.> Hdélder’s inequality yields 


[ire "dx S (2r)” [oa ee dz|. 
(2.5) ° oe | 


— “ » | dz| pip’ 
Sry? J dx) J, WSOP ldel ey | Goa wep 


where 1/p + 1/p’ = 1. 
Let R > 1 be arbitrary and let Ex denote the ellipse with foci at +1 and 
semi-axes 


(2.6) a = }(R+ RR), b = 3(R — R'). 


° — 2 ° ° ° 
An elementary discussion’ gives the following expression for the shortest dis- 
tance D = D(z, R) of x from Ex : 


(oa — 2*)', if |z| sa", 


la—|z|, ifa's|lz|s1. 


(2.7) D(z, R) = 


Now choose for C, the circle with the center z and radius D; this circle is in- 
ternally tangent to Ey. Then the last factor in the right member of (2.5) 
does not exceed (2nD'*”’)””” = (2r)””"D”'. As for the first expression in 
braces in (2.5), a successive application of Lemma 2.1 and of the remark fol- 
lowing Lemma 2.2 yields 


+1 
| \f(z) |"|dz! s “| f(z) |?!dz| s aaner' [ f(a) |? dex. 
Cy, ER 1 


* An analogous argument, in the essentially simpler case p = ~, was used by Montel, 
Sur les polynomes d’approximation, Bulletin de la Société Mathématique de France, vol. 
46 (1918), pp. 151-192, esp. pp. 160-161. 

® For b2u2 + a%y? = ath? we have |(u — x)? + et) = f(a tu ar)? + b(1 — x?)}}. 








“ase 
‘ing 


ase 


lis- 


el, 
ol. 
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On substituting these results in (2.5) we have 


+1 rl +1 
(2.8) [ f(z) |’dr Sx cr | f(x) rar. [ | D(x, R)}~? dz. 
= -1 —3 


3. To obtain the most favorable estimate for the right-hand member of 


(2.8) let R | 1,s0 that 


a—1~>}(R - 1)’, b>R-1. 

Then 
+1 l/a , 
[ {| D(x, R)}~? ‘dz = ar | QQ — 2) ’t? dex 
(2.9) if 0 | 
+ 2 | (a — x)” dz, 

l/a 

where 


[a = ryt dy eo Kp+D [va ~ Kept) ay 
P 0 


~ get) (i — er ~w (R oe 1"? 


Mp—1) ~ p-1 


’ 


1 
| (a— x)” ‘dx = p ‘(a — 1) "$1 — (14 L/a)”} 
l/a 


> 2’p "(1 — 2°")(R — 1)”, 
so that finally 


+1 
| D(a, R)} Pp ‘dz = O{(R — 1)°*"}. 
—1 


On putting R = 1 + 1/n we obtain an upper bound for the right member in 
(2.9), which is O}(R — 1)°°?} = O(n’”). On substituting into (2.8) we obtain 
inequality (1.6) in the case p > 1. 

It should be observed that in the case p = 1 the same method yields an 
estimate O(n*log n) for (2.9). This leads to a result much less precise than 
that obtained in Part III by the second method. 


III. General case. Second method 


1. Our second method in turn is based on two lemmas which may present an 
interest in themselves and on the corresponding result for trigonometric poly- 
nomials, which was stated in Part I. 

LemMMA 3.1. If p 2 1 and f(x) is an arbitrary polynomial (# 0) of degree n, 
then 


+1 +1 
(4.1) f(a) |? A — a’) bdr < 2(np + 1)""* (np) a | f(x) |? de. 
a fi —i 
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Lemma 3.2. Under the assumptions of Lemma 3.1 we have’ 


[ f(x) |? dx 
=" 


(4.2) — : 
< (. en :) (np oF p)"”**(np + 1) np 7 i f(z) me Sa gy» dr. 


In what follows we shall use the elementary transformation z = }(w + w ') 
which for w = e” reduces to x = cos 6. We shall also use integrals extended 
over certain circles |w{| = p. We therefore shall write w = pe” and integrate 
with respect to @ over 0 S 6 S 27. Now, to prove Lemma 3.1, let 0 < r < 1. 
Then" 


w—1 w—l 


| w"fi3(w+w')} |” do< | wf (w+ wd} |? dé 


jwl=r jw|=1 


2 +1 
= [ feos 0) "| sin 9} da = | \f(x) "dx = I,. 
0 § 
Since | w’ — 1| 21 — Pr for|w) = r, we have 
| w'fid(w + w')} |?da< 


whence 


+1 Qe 
l f(x)? — a’) de =} | | f (cos 6) |” d@ 
=| ( 


0 1 » “ 1 |p 
=} | iw"'fie(w+w)} \?dd< 3 | w'fis(w+w )} Pd 


jwi=1 jwj=r—! 


r 2; 2np 
aiid | lw"fik(w + w ')} \°d@0< i = Fy. 
— TT 
|wl=r 
Lemma 3.1 follows immediately if we put 1 — r° = (np + 1)". 
2 \p: 
1° The factor , should be replaced by 1 when p = 1. 
p- 
“ To derive this inequality it is perhaps simplest to observe that if Fi(z), ---, Pi(z) 
are any set of regular analytic functions, then 
F(z) = | Fi(2) |"! «++ | Fa(z) 1", pi 2o0,---,m2 09, 


is subharmonic, and to use well-known properties of subharmonic functions. Cf. Radd, 
Subharmonic functions, Ergebnisse der Mathematik, vol. 5, Berlin, 1987: see pn & 2.4. 











I. 


I. 
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A similar argument can be used in proving Lemma 3.2. Indeed, we have 


+1 2e 
if (x) |’'dx = 3 | f (cos 6) |” | sin 6 | dé 
-1 0 


= / lwhthw+w)) =" a< f iwyteo + wy} peo 
, 4 Fo 4 
juw|=r 
2np-—2 r ney 1/ —ly) P 1 = “uw 
=r jw'fit(w+w)} | rl dé 
jw|=r 


<2" -— 7)” / |w"f{a(w + w')} |?) 1 — w* |? do 


jw l=r 


<a — yer fl wf hw + w)} (P11 — wl do 


Jw|=1 
—i —2np—2 2\1-— 
= rer — hry, 


where 
+1 " 
I, = f(x) |" - ry?” dr. 
—3 


Lemma 3.2 follows immediately if we write here 


1 — rr = (p — 1)(np + p)™. 


2. Inequality (1.6) now is readily derived by combining Lemmas 3.1 and 


3.2 with Zygmund’s result (1.5). Indeed, if we put again 


+1 


we have by Lemma 3.1 
‘i |f (cos @) |!" d@ < 4(np + 1)""*"(np) "Ih, 
whence, by (1.5), 
¥ f'(cos 0) sin 6{"d@ < A(np + 1)""" (np) “’n?di. 
On replacing n by (n — 1) in Lemma 3.2 we now have 


+1 9 pl 
| if'(x) |? dx < ( ~ ) (np)""(np — p+ 1) ""*? *.2(np + 1)""* (np) 
1 p- 


= 2’(p — 1)' "(np + 1)""*"(np — p+ Do"? 


"Pl, 


lp 
n’dy. 
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The coefficient of I, is ~ (2ep)’(p — 1)'’n” as n — & for p fixed. This 
proves (1.6). 
On the other hand, if we allow p — for n fixed and observe that 


n+l 

lim 2(p — 1) '*" "(np + :1)"*""?(np — p+ 1)" =2 2 — < en’, 

p--e (n = 1)” 1 
we obtain in the classical problem of A. Markoff the estimate 

If’ || < 2en* || f |l. 
IV. Exactness of the order n° 

1. Let f(z) = PS%°"(x2) be Jacobi’s polynomial in the “ultraspherical” 

case.” Then, asn — ~, 
+1 x 
(4.1) / | f(x) |? dx a n*”*Q7°*? [ lu “Ja(u) |"udu, 
—1 0 


provided that (a + 4)p > 2. Here J.(u) is Bessel’s function of order a. 
The special case p = 1 of this formula can be found in a paper by Szegé.” 

The following line of argument is slightly simpler than the one used there. 
We use the formula of “Mehler’s type’”’ 


(4.2) lim Pe (cos “) = (u/2) “J.(u), 
d 


nx 


° . ° ° ° 14 
which holds uniformly over every finite interval, and the estimate 


(4.3) P‘*” (cos 0=) = 0 * ' O(n"), n'<s0< 72/2. 


2. Let now w be a fixed positive number. Then 


+1 x/2 w/n w/2 
/ P''(z) |? dz = 2 | P'2"” (cos 6) |’ sin dé = 2 | + 2 | ; 
1 0 0 win 


The first term of the last sum according to (4.2) equals 


2 w ‘ Cu p u . : w 
a) . ap apt a , 
P32" { cos sin —du =n” “2” ju “Jaw \"udu. 
nm jo n " 0 


The second term according to (4.3) is 


ow [ a Pen”? 9 da = O(1)n®” a “tY”, 


/n 


Since w can be taken arbitrarily large, this proves (4.1). 


12 ('f. the notation in Pélya-Szegé, loc. cit., vol. 2, pp. 938, 94, 292, 293, problem 98. 

18 Asymplotische Entwicklungen der Jacobischen Polynome, Schriften der Kénigsberger 
Gelehrten Gesellschaft, 1933, pp. 35-112, esp. p. 88. 
(°F. Szegé, loc. cit., pp. 74, 77 








his 


13 
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3. Now we observe that” 


fa) = £ Pa) = (@ + Hn + WPS), 


so that, by (4.1), 
+1 2 
(4.4) | f(x) |? da  (n/2)? net??? gerPer ju Jass(u) |? udu, 
—] | 0 


and the ratio of the integrals in the left members of (4.4) and (4.1) remains 
> Bn’? , as stated in the theorem of Part I, §3. 


\. E. Schmidt’s case p = 2 


1. Without loss of generality we can confine ourselves to real polynomials 
f(z). Indeed, if we write f(z) = g(x) + th(x), where g(x) and h(x) have real 


coefficients, we have 


[ \f(x) ?dx = ti {g(a)}?dx + [ {h(x)}? dz, 


(5.1) +1 +1 +1 
/ \f’(x) Pdr = [ {g’'(x)}°dx + [ {h'(x)}* dz. 


-1 
Let now f(z) = ao + az + ax + --- +a,2",n 22. Then the determina- 
+1 +1 
tion of the maximum of [ (f’(x)}? dx under the condition [ (f(x) }?dx = 1 
all = 


is a characteristic value problem leading to the system of equations 


0 


(5.2) BS [ [ U"(a)\%de — rf. (702) dz | =06=61,8---.a. 


This system is equivalent to the condition that 


(5.3) [ f'(x)q'(x) dx — af, f(x)q(x) dx = 0 


be satisfied by an arbitrary polynomial g(x) of degree n. Integration by parts 
gives 


(5.4) [ f(x) + Af(a)} g(a) dx = f(a) — f'(—Da(—D. 


2. Introducing Legendre polynomials {P,(x)} in the usual notation, we put 
in (5.3) 


(5.5) giv) = & & + DPLWPY), 


'S Szegi, loc. cit., p. 38, (5). 
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where y is a parameter. By using familiar properties of Legendre polynomials, 
we obtain 
6610 +NwW =F LX & + DPW) —f(-1) & & + DP-») 

= 4D {PAY + Pra) + (-D'S(-DEPAY) — Pru}. 
Now it is helpful to write 


(5.7) — fly) = aoPrasly) + Pi (y) + Peay) + asP2(y) + ++. 


This is obviously possible if the real constants ap , a , a2, --- , @, are suitably 
chosen. On substituting (5.7) into (5.6) and comparing the coefficients we get 


hao = 4f’(1) — 8(—1)"f'(—1) = ao Ph a(l) + ee PS) + +, 
(5.8) ra = 3f'(1) + (—D"f'(—D = mP2(1) + a PY(1) + ---, 
Aa, + a2 = 0 (vy = 2,3, ---, n). 


This system is equivalent to (5.2) and readily furnishes the characteristic values 
and functions. Indeed, since ) is real and positive, we have 


(5.9) ax, = (—A) "a, — aeyr = (—A) "em, (» 2 1), 
and 
(5.10) hao = ao | Prsa(l) — XPM) + APPL) — ++ +4, 
5. hen = on {P2(1) — A PO") + APPL") = «- - - }.. 
Thus the set of characteristic values is obtained from the two equations 
Prd) — XT PL a) + XP PLD — +--+ = 0, 
(5.11) pee ines 
P,Q) — X'P,Q) + X° PY) — +++ = 0. 


3. Observe that we have for all v and n 


a (n + 2)! 1 2 


- 4v yy(2v) a ¥ 
(5.12) n'P, (1) =n (n — 2v)! 2%(2v)! ~ 2%(2v)!’ 


while for v fixed 


- * —4y (2¥) 1 
— ee ree © ear 
Let Ano, Ant, --* » Ann denote the characteristic values in decreasing order. If 


we observe (5.12), (5.13), and apply Hurwitz’ theorem to cach of the equations 
(5.11) we readily see that the limit 


(5.14) lim n ‘Anke = Ye (k = 0, 1,2, ---) 


n~-*a 
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exists for each fixed k and that furthermore the set {A,} represents the set of all 
the roots of 


- i ut. 
(5.15) rar 2**(2y)! = cos (2X’) = 0, 
so that 
(5.16) am = 9 (2k +1)° (k = 0, 1, 2, ---)- 


If now M*, denotes the maximum of the ratio of the integrals [ 


f'@) ae 


+1 
and / {f(x)}* dx, we have the final result”® 
—1 


(5.17) lim nM, = limn’*vby = M = 

Remark. It can be shown that the largest characteristic value \,» necessarily 
is a root of the first of the equations (5.11). Indeed let fn4:(A) and f,(A) denote 
the left-hand members of the two equations (5.11), respectively. A simple 
calculation shows that 


(5.18) SNL Fasad) + FaO0]] = (m+ DNOM-GO). 


If fasi(Ano) ¥ 0, we necessarily have fusi(Ano) > 0, fasi(no) > 0, fa(Ano) = 0, 
so that the left-hand member of (5.18) is positive for 4 = Ano, whereas 
fn(Ano) = 0. This is a contradiction. 

In fact it is easy to show that Axo satisfies only the first of the two equations 
(5.11). 


YaLe UNIvEerRsITy, WASHINGTON UNIVERSITY, AND BROWN UNIVERSITY. 


16 There is a slight discrepancy between this result and that of E. Schmidt according 
to which this limit would be 1 instead of «7. 








(n — 1)-DIMENSIONAL CHARACTERISTIC STRIPS OF A FIRST 
ORDER EQUATION AND CAUCHY’S PROBLEM 


By E. W. Tittr 
Consider the first order partial differential equation 
(a) F(z*|z| pa) = 0 (pa = 02/dx") 


in one unknown z and n independent variables x* (n = 3). The purpose of 
the present paper is to generalize to the case of more than two independent 
variables the usual geometrical discussion of Cauchy’s problem showing the 
manifolds for which the problem is indeterminate.’ In doing this we introduce 
the concept of an (n — 1)-dimensional characteristic strip and study its relation 
to the one-dimensional characteristic strips. 


1. We first recall the geometrical approach to the one-dimensional character- 
istic strip. If we assume that the space S"*’ with codrdinates z',---, 2", 
z is Euclidean with rectangular Cartesian coérdinates, the problem of inte- 
grating the equation (a) is that of determining a hypersurface” 


(1.1) = 2(z', Kiba x") 

in S"*’ such that the direction ratios p,: --- 1p,:—1 of the normal to (1.1) 
satisfy the condition (a) at each point P of (1.1). The geometrical configura- 
tion consisting of a point P(z’, -.. , 2", z) and a hyperplane passing through P, 
namely, 

(1.2) Z—z= p.(X* — z*), 


in called an element. In general, the integral elements at P, i.e., those which 
are possible tangent hyperplanes to integral hypersurfaces at P, envelope a 
hypercone 7 with vertex at P. It then follows easily that an integral element 
(1.2) is tangent to the hypercone 7 along the generator given by* 


‘ Pr .# ys” fu 
(1.3) — = +e. se .——, 
Fy, F,, Pal, 

teceived June 16, 1937; developed in part while the author was a National Research 


Fellow and mentioned in a paper presented to the American Mathematical Society, Feb- 
ruary 23, 1935. See Abstract 41-3-103, Bull. Amer. Math. Soc., vol. 41 (1935), p. 182. 

1 See, for example, E. Goursat, Cours d’Analyse Mathématique, Tome II, 1924, Chapter 
22. We do not discuss the regularity requirements on F or the initial manifold. For a 
treatment of this question in the case of two independent variables the reader is referred 
to G. A. Bliss, Princeton Colloquium Lectures, Amer. Math. Soc., 1913, p. 98. 

2 In S**! we shall call an n-dimensional spread, a hypersurface, and an (n — 1)-dimen- 
sional spread, an edge. For linear spreads we shall use the terminology hyperplane and 
plane edge. 

*Cf. Goursat, loc. cit., p. 616. 
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The characteristic curves are the curves traced on an integral hypersurface 
which are tangent at each point P to the generator (1.3) determined by the 
integral element at P. As is well known, these curves are determined by the 
system of ordinary equations 

1 


(1.4) dx -— dx dz dp, dp, 


1 
_ = - = = mMmZeeos— = dv 
F,, F,, Pak, —Fa—Fipm —Fy»—F,p, 
without knowledge of an integral hypersurface. The geometrical configuration 
consisting of a curve together with a strip of elements along it satisfying the 

system (1.4) will be called a one-dimensional characteristic strip. 


2. Now let us turn to the consideration of Cauchy’s problem for equation (a), 
i.e., the problem of passing an integral hypersurface (1.1) through an arbitrary 
edge 


(2.1) x* = E(v', --- ,v"'); z= g(v',--- 0"). 


Generalizing the idea of a characteristic curve, by a characteristic edge we shall 
mean an edge which lies on an integral hypersurface and which has the property 
that its tangent plane edge at each point P contains the generator (1.3) deter- 
mined by the integral element at P. We shall now proceed to find for our 
characteristic edge a system of partial differential equations, which is inde- 
pendent of the integral hypersurface. 

If the plane edge, tangent to the characteristic edge (2.1) at the point P, 
is to contain the generator (1.3) associated with P, then the rank of the matrix 


’ , 
¢ F,, F,, PaF,, 
1 n e 
(2.2) a a 
1 n 
Tri Tri ~n—l 


where the subscripts on the x’s and z denote partial derivatives with respect to 
the v’s, must be less than n.* Since the characteristic edge (2.1) lies on an 
integral hypersurface, the integral element at P must contain the tangent plane 
edge at P, i-e., 


(2.3) & = pels. 


Multiplying each of the first » columns of (2.2) by po and subtracting from the 
last, we have on account of (2.3) that the condition on (2.2) is completely 
equivalent to 


(2.4) F,.Da = 0, 


‘In what follows, when we use the term edge we always imply that the matrix obtained 
by striking out the first row and last column of (2.2) is of maximum rank. Throughout 
Greek letters a, 8, y, --- have the range 1, --- , »; Latin letters a, b, c, --- , the range 
i see 1; and Latin letters 7, j/, k. the range 2, --- , n 1 
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where the D, are the cofactors of F,, in the determinant of the first n columns 
of (2.2). Differentiating the equation (a) with respect to 2* we obtain 


, az 
(2.5) Fy, Pas + F.pa + Fyn = 0 (rx = Sis) 
If we adopt the notation A% for the cofactor of the element zf in the deter- 
minant of the first n columns of (2.2), then we have immediately the relations 


(2.6) F,,,Ds + 22A$ = 0. 


If a = £8, then (2.6) is equivalent to (2.4); and the other equations (2.6) are 
obvious. We then multiply the equations, 


ODa 


8 
= Dasla 
ag Pat%e» 


through by A‘ , sum on a, and make use of (2.5) and (2.6), getting 


(2.7) Ag ®P: _ DF pe + Fe) = 0. 

ov* 
If in particular D, ¥ 0, the equations (2.7) with 8 = 1 imply the remainder. 
For it follows from (2.6) that any vector A}, --- , A} ', Ds (8 ¥ 1) is either 
a zero vector or proportional to Aj,---,A]', D,. Let us associate with 
each point of the characteristic edge (2.1) the integral element containing the 
tangent plane edge. Then any set of (2n + 1) functions, 


(2.8) x* = §(v), z = ¢(v), Pa = Tal), 
with D, # 0, which satisfy the system’* 
(a) F,, Da = 9, (b) 2. — Pers = 0, 
(ce) Aj -** — DF .pa + Fx) = 0, 
ov* 
will be called an (n — 1)-dimensional characteristic strip. 
We have immediately that over an (n — 1)-dimensional characteristic strip 


the function F = constant. For if we multiply the equation (2.9c) through by 
zy, sum on @ and make use of the equation (2.9b) we get 


22 At °P* _ DAF. pe + Fee) 28 =0. 
av’ 
This becomes 
ak 
D, =, = 0, 
aw 
when use is made of (2.6) and (2.9b). 


* In connection with (2.9a) let us notice that an edge lying on an integral hypersurface 
of a second order equation which satisfies the condition Fy,,Da Dg = 0 is the well-known 
characteristic surface. Let us also notice that in case n = 2 the system (2.9) reduces 


to (1.4). 
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3. Now let us show that an (n — 1)-dimensional characteristic strip is com- 
posed of an (n — 2)-parameter family of one-dimensional characteristic strips. 
In order to show that each one-dimensional characteristic strip determined by an 
element of our (x — 1)-dimensional characteristic strip lies entirely on our 
(n — 1)-dimensional strip, we consider the system of ordinary equations 

dv" Aj 
(3.1) dot = ~ Dy ’ 


where the right members are functions of v* alone in virtue of (2.8). By the 
existence theorem for ordinary equations the system (3.1) has a unique solution, 


(3.2) vo” = V*(o' | v9), 


which reduces to vj for a’ = 0. The set of (2n + 1) functions of 5’ obtained 

by substituting (3.2) into (2.8) constitutes a one-dimensional characteristic 

strip. For multiplying the equations (3.1) by xf , summing on a, and making 

use of (2.9a) or (2.6) we find that the first n equations (1.4) are satisfied. The 

next equation (1.4) is a consequence of the first n and the equation (2.9b). 

The remaining equations (1.4) follow from (2.9c) and the equations (3.1). 
Now let us consider any set of (n — 1) functions, 


(3.3) vo = (0, --- , 5"), 
which for the set of values 0’ = 0 make the determinant 


Aj[vo(0)] --- At *[vs(0)] 


avi (0) uy ~(0) 

(3.4) a0" a0" ~ 0. 
av3(0) av) '(0) 
ap" ap 


On account of (3.4) the result of replacing the arbitrary constants vp in (3.2) 
by (3.3), namely, v* = v*(8”), can be regarded as a change of parameter in the 
neighborhood of the set of values «*° = 0. In the transformed strip (2.8) the 
one-dimensional strips along which #' varies are characteristic. Let the result 
of setting i’ = 0 in the transformed strip (2.8) be denoted by 


(3.5) roe, a% 5 0”); zo(0", oo yD" >; Paol®, -0+ 8"). 


In the neighborhood of the set of values 6° = 0 the set of functions (3.5) satisfy 
the conditions 


(a) Flae | 20| paol = eonst., 
(3.6) O20 . arp 
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and make the rank of the matrix 


Ping F ons 
ax aay 
(3.7) ae" ab" 
OXo , axe 

op" op"! 


equal to (n — 1). 

Conversely, let us show that through any (n — 2)-dimensional strip (3.5) 
satisfying the conditions (3.6) and (3.7) there passes a unique (n — 1)-dimensional 
characteristic strip. Suppose that we have an integral of the system (1.4) 
depending on an auxiliary variable 3' and (2n + 1) arbitrary constants, 2%, 
20, Pao Which reduces to xg , 2 , Pao for b| = 0. Let the result of substituting 
the (2n + 1) functions (3.5) into this integral be denoted by S; we must show 
that S is an (nm — 1)-dimensional characteristic strip. S satisfies the equation 
(2.9a) as a direct consequence of the first n equations (1.4). Alse on account 
of the same equations (1.4) the quantities A% which do not vanish identically 
will be Al. In particular, let us consider Alt # 0. Since A} = —D,, the 
equations (2.9¢) are identical with the last n equations (1.4). When a = 1, 
the equation (2.9b) follows from the first (n + 1) equations (1.4). In order 
to show that the remaining equations (2.9b) are satisfied we put 


Vi = 2% — Peli - 


Differentiate this relation with respect to d', make use of (2.9b) for a = 1 and 
equations (1.4), and obtain 


av, _ FP OPa 


(3.8 _ P 
) ao! * ao 


+ F.pax? + Fyax; . 


From (3.6a) we have dF /a0' = 0, and equations (3.8) become 


Vs py, 
a0" 
Since V; = Ofora' = 0 by (3.6b), we have V; = 0. 

The proof that the strip S is unique consists in showing that any other 
(n — 1)-dimensional characteristic strip S* containing the (n — 2)-dimensional 
strip (3.5) must contain all of the (n — 2)-parameter family of one-dimensional 
characteristic strips which constitute S. By assumption, the initial element 
of any one of the one-dimensional characteristic strips in S lies also on S*. 
By the argument in the first part of §3, the strip S* consists of an (n — 2)- 
parameter family of one-dimensional characteristic strips. Since a one-dimen- 
sional characteristic strip is determined by its initial element, the argument is 
complete. 

The construction of our (n — 1)-dimensional characteristic strip can be given 
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a geometrical interpretation. Let an initial (mn — 2)-dimensional manifold m 
be given by the first (n + 1) equations (3.5). Suppose that there exists an 
element E which contains the (n — 2)-spread | tangent to m at P (#' = 0) and 
which is tangent to the hypercone 7’ with vertex at P. Suppose also that / 
does not contain the generator through P determined by E. Then the equa- 
tions (3.6) determine a one-parameter family of strips (3.5) containing the 
manifold m. Then as E describes one of these strips containing m, the (nm — 2)- 
parameter family of one-dimensional characteristic strips thus determined con- 
stitutes the (n — 1)-dimensional characteristic strip. 


4. Let us recall that Cauchy’s method of integrating the equation (a) is to 
replace the arbitrary constants in a solution of (1.4) by (2n + 1) functions of 
(n — 1) parameters, namely, 

x = E*(v", wee, v"); Za = E(v", ror it | 


(4.1) 


9 
Pa = Fal, a ), 
satisfying the conditions 


Fixe | 20 | Pao) = 0, 


(4.2) az ore 
_ 0 = a 
3 o Pad ; , (o — 2, ; n). 


We now make a few remarks to show how the above theory can be applied 
in a discussion of Cauchy’s problem. Let the initial edge M be given by the 
first (n + 1) equations (4.1) and let us suppose that there exists a set of values 
Pao Which satisfy the system (4.2) at some point P(vj) of M. First consider the 
case in which 
(4.3) F,,Da #0 


for the set of values pao, E*(v5), E(vs). Henee we can solve equations (4.2) 
for the quantities pao as functions of v°, --- ,v". On account of (4.3) the strip 
thus obtained will give us a solution of Cauchy's problem in the form (1.1). 
xeometrically we have assumed that there exists an element 2, which contains 
the plane edge L tangent to M at P, and which is tangent to the hypercone 7 
with vertex at P. We have supposed also that L does not contain the generator 
through P determined by FE. Then the (xm — 1)-parameter family of one- 
dimensional characteristic strips, determined by E as P describes a neighbor- 
hood of itself on M, comprise our integral hypersurface. This integral hyper- 
surface can also be thought of as generated by the one-parameter family of 
(n — 1)-dimensional characteristic strips determined by any one-parameter 
family of (n — 2)-dimensional spreads lying on the edge M. 

Next let us consider the case where M bears an (n — 1)-dimensional char- 
acteristic strip (4.1) and Cauchy’s problem becomes indeterminate. Select any 
(nm — 2)-dimensional spread m lying on M such that at any point P of m the 
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(n — 2)-spread | tangent to m at P does not contain the generator through P 
determined by E. Through P on m pass a line t, on E, which with | determines 
a plane edge that does not contain the generator determined by EZ. Let T 
be a curve through P with ¢ for its tangent line. Select a differentiable (n — 2)- 
parameter family of such curves I’, one through each point of m, which taken 
together comprise an edge Yt. The plane edge tangent to Mt at any point P 
of m lies in E but does not contain the generator through P determined by E. 
Therefore the integral hypersurface passing through Yt? must contain the 
(n — 1)-dimensional characteristic strip through m, i.e., the strip (4.1). 

Any edge M lying on an integral hypersurface, whose tangent plane edge L 
at each point P contains the generator through P determined by E but which 
does not bear an (n — 1)-dimensional characteristic strip, must contain singu- 
larities of the integral hypersurface. For if the integral hypersurface possessed 
second derivatives at each point of M, we should conclude by the argument 
in §2 that M bears an (n — 1)-dimensional characteristic strip. 
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THEOREMS ON FOURIER SERIES AND POWER SERIES 
By H. R. Pitr 


1. Introduction 
1.1. Notation. If p > 0, we write 


Srlod = (Hla) 


Tale} = ( I (Ze! vy de) 
Mm, [F()] = ( [ \F@ |? ao) 


If g(z) is a regular analytic function for | z| < 1, we write 


r l/p 
S,lo(2 = tim ( [ jgtre") rae)” 


(The limit exists, since the expression in the bracket increases with r.) 
1.2. Suppose that F(6) is periodic and integrable and that g(z) is regular in 
iz| <1. Let 


(1.2.1) F(@) ~ > a,e"" (ao = 0), 
(1.2.2) g(z) = Do ene” (jz| <2). 


We shall prove that if p, q, @ satisfy certain conditions, 


(1.2.3) Salen] S KH,[9(z)(1 — 2)", 
(1.2.4) S,la,n] < KM,[F(0)e*), 
where 
(1.2.5) ei 8 re Bide Si 

p q 


and the constants K(p, q, a) are independent of g(z) and F(@). 
Special cases of these inequalities, due to Hausdorff’ and Hardy and Little- 


Received October 16, 1937. 
' Hausdorff [5], Theorem Il. (Numbers in brackets refer to the references at the end 
of the paper.) This is the case a = 7 = 0 of (1.2.4). 
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2 ‘er e ry? . 
wood, are well known. They can be derived from Theorems 1 and 2 of this 
paper by substituting appropriate values of p, q and a. 


2. A theorem for power series 


2.1. Our principal result is as follows. 
THEorREM 1. Let 


g(z) = De ene" (jz| < 1); 
x >q2p>d0 a 20, 
p 
Then 
(A) Sle. n < K®,[g(2)0 — z)"), 
(B) T,[e.n "| < KH,[g(z)( — z)"), 


whenever the right side is finite. 

We shall denote the inequalities (A) and (B), for particular values of p, q 
and a, by A[p, q, a] and B[p, q, a], and use the symbol > to show relations 
of inclusion between them. For example, we write 


Alp, q, a] > Bir, s, 8], 


if B[r, s, 8] can be deduced from A[p, q, a]. 
2.2. Lemma 1. 


(a) Let 
x 2>q2l, a<l, qb <1; 
—" a+b—1 ~~ Bane 
" y=1 ¥*(n — ve 
Then 
Salen] 4 K(a, b, q) S,[b,]. 
(b) Let 
r >. > i. ees, sen. 
8 8 
ititen 5. 2% 
8 r 


—1 
> bb,» 


Cc, =n - " 
: v=1 y*(n _ v)* 


2 Hardy and Littlewood [1] and [4]. The former covers the cases a = 0 or y = 0, while 
the latter deals with the case a = 1/p’, \ = 1/q of (1.2.3). 








lis 


11s 


hile 
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Then 


S,len] S K(a, r,s) Si [b,. 


These results are special cases of a very general inequality of Hardy and 
Littlewood, Theorem 1 of [2]. We obtain (a) (with b), , c, instead of b, , en) 
by writing 


p= *%, r= q, a= 4, p=b+", y=atb+7—1, 


1 1 
p= ¢ > 6, a=B=a+-, 7 = = = ¢, 
8 r 
a, = b, = b,n™, Ch =c,n 
2.3. LemMMA 2. 
(a) Alp, q, 0] is true of 
~>q2p>rl, cers 
pq 
l £é. : 
(b) A] p,g, 1 —- — is true if 
P 4 
1 1 
~>q2p>l, -+-<1 
Pp @q 


These results follow at once from Theorems 9 and 10 of [1]. In fact, the 
latter results are true for general Fourier series, whereas we are concerned here 
only with Fourier power series. 

2.4. Lemma 3. 

Alp, q, a] > Bip, q, a] fq>1; 
B{p, q, a] > Alp, q, a] fq sl. 


These are immediate consequences of the following inequalities of Hardy and 
: 3 
Littlewood. 


Tole] < K(q)S,[e.n ifq>1; 
S [ler "| < K(q)T len] ifqg sl. 


2.5. Lemma 4. If x 2q21,% >p>0O,€ >0,a 20, then 
Alp, q, a] D Alp, gq, a + «]. 
8 Hardy and Littlewood [3], Theorems 3 and 11. (There is a misprint in Theorem 11. 


The exponent (p + q — pq)/q on the left of (4.11) should be negative.) The case g = 1 
is trivial. 
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Let 
¢(z) = > n*'2” (jz| < 1). 
Then‘ ¢(z) is regular for |z| < 1, z ¥ 1, and has no zero, except at z = 0, in 


|z| S$ 1. Moreover, | ¢(z)(1 — z)‘ |" is bounded in |}z| S$ 1. Hence, if we 
write 


zw(z) = g(z)/¢(z) = >> bn 2” (jz| < 1), 
1 
we have 
ae n—l i. 
= & yl-e ’ 


H,[w(z)2(1 — z)*] = K(e)H,[g(z)(1 — 2)*"'). 
We can write 


n—l —X 

= oe b,(n — 
c,.n A—e =n A—e n—o(9 v) P 
y= v(m — v) 


and since 
1 
l—e<l, —-rA = —-a--< 


it follows from Lemma 1(a) that 
S,[e.n ‘| s KS,[b.n”). 
Hence, assuming A[p, 9, a], we have 
Sle.n ‘| S KS,[b.n *) S KH,[w(z)2(1 — z)*] Ss KH,[g(z)( — z)*"). 


This gives A[p, qg, a + él. 
2.6. Lemma 5. Let 


(2.6.1) q> 1, a> i, s2p>0, a 20, 
: —-1s . s . 
5 . a 

Then 


Alp, 8, a] D A[4hp, q, 2a]. 


By using « theorem of F. Riesz,’ we may suppose that g(z) has no zeros in 
z < |, exeept at z = 0, and write 


2g(z) = [w(2), wz) = Dbz’, 
1 


* See Hardy and Littlewood [4], page 467 
°F. Riesz (6). See also page 207 of [1] 





s in 
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so that 


n—l 
C1 = } b,b,_,. 


v=l 


We suppose that A[p, s, a] is true. Then 


(2.6.2) S,.[b.n “*] s K9,[o(z)(1 _ z)*], 
where 
1 
ae-+ : +a-1 
Se 
Let 
r»=2 + an 2a — 1 
P @q 
Then 
= =" < bv “b,_,.(n — v)* 
Chin =n ———_—__ —__ —___— —_ 


= y(n —v)* 


It is plain from (2.6.1) that 
. OsrA-2&= 


It follows from Lemma 1(b) that 
Selena] S KSi[b.n“] S KH} [w(2)(1 — 2)%), 
by (2.6.2). Hence 
Salenan”] S KGHjp[w(2)*(1 — 2)**] = KGHpolg@)C — 2)**I, 


which is equivalent to A[}p, q, 2a]. 
2.7. Lemma 6. If © >q > p> 0,a 2 0, then 


Bp, 9, a] > B[}p, 44, 2a]. 


As in Lemma 5, we may® suppose that g(z) 0 for |z| < 1, z ¥ 0, and write 


2 n—l 
zg(z) = [w(z)]’, w(z) = >> b,2’, a.10 2 SA... 


v=l v=] 
We suppose that B[p, q, a] is true, so that 
T [bon] < KH,[o(2)(1 — 2)*], 
where 


l l 
w= -+-+a-— 1. 
Pq 


®* We use the fact that W/o + ¥] S A(r)[M.-(e) + W(¥)] for any functions ¢, ¢ and 
any positive r. If r 2 1, this is Minkowski’s inequality. If r < 1, it follows from the 
inequalities a + b S (at + br)" S 2F(a + b) 











Let 


A= “+ : + 2a — 1. 
Then 
oo oo n—1 
a ae oe Yaa TT) ee 
n=1 n=1 v=1 


x a 


= bi ~ |b, | xz | bn | 2"(n + y) tHe 


n=1 v=1 


< (= |b, | evo), 


v=] 
since 
A+ 1 — 4/q = 24u+ 1 — 2/q) 20 


and (a + b)’ = ab for any positive a, b. Hence 


Tyoleran 4) S Tilban*""] < KG; [o(e)(1 — 2)*] 
= K§,,[o(2)"(1 — 2)""] = KSplg@)201 — 2)", 


and this is equivalent to B[}p, }q, 2a]. 

2.8. Proof of Theorem 1. We suppose first that » > q 2p> 1. In view 
of Lemma 3, it is sufficient to prove A[p, q, a], and this follows at once from 
Lemmas 2 and 4. 

Next, let « >¢ >12p>0. Because of Lemmas 3 and 4, it is sufficient 
to prove A[p, g, 0]. We suppose first that g > 1 2 p> 3. We can choose s 
so that 


iucS G-4+8 cere: 
Pp q q Dp 
Then 
ae x s2 2p>1. 
S q 8 
so that 


A[2p, s, 0] > Alp, 4, 9}, 


by Lemma 5. Moreover, since 
; l 
“o>s22n>1, 4 > 1, 
8 


it follows from Lemma 2(a) that A[2p, s, 0] is true. Hence A[p, q, 0] is true 
forg > 12 p> }. If we now put s = q in Lemma 5, we can prove that 


Alp, q, 0] is true for 
bh el 
9 = q° ge HP one 


successively, and it follows that the result is true generally forg > 1 2 p > 0. 














rue 
hat 





THEOREMS ON FOURIER SERIES AND POWER SERIES 753 


Finally, we have to consider the case 1 2 q 2 p> 0. Because of Lemma 3, 
it is sufficient to prove B[p, q, a]. We choose an integer k so that 2 = 2*q > 1. 
Then B[2‘ p, 2*¢q, a] is true for a = 0, by what we have already proved, and the 
conclusion follows by repeated application of Lemma 6. This completes the 
proof of Theorem 1. 

2.9. We can express Theorem 1 in a slightly different form. We know’ 
that if ,[g(z)(1 — z)*] is finite, then g(z) has a boundary function G(@) such 
that the ratio 


Hplg(z)(1 — z)*]: Mt [GE (0)e*] 


lies between positive bounds K(p, a). Conversely, if It,[G(6)6*] is finite, then 
(7(@) is the boundary function of an analytic function g(z), and the same relation 
holds. It follows that we may replace §,[g(z)(1 — z)“] by 9,[G(@)é*] in the 
conclusion of Theorem 1. 


3. Theorems for Fourier series 


3.1. We shall now show that the inequality (1.2.4) can be deduced from 
Theorem 1 when p, g and a@ satisfy certain extra conditions. 
THEOREM 2. Suppose that F(@) is integrable and periodic. Let 


F@) ~ dD a,e"”, ad = 0; 
1 
x >q2p>l, —=>a2z0, 


P 


nto thew tee 
Pp 4 


Then 
(a) SJa,n*] < KM,[F(0)e*), 
(b) M,[F(0)e |} < KS,la, n°"). 


Let G(@) be the conjugate of F(@). Then F(@) + 7G(@) and F(@) — iG(@) 
are boundary functions of 2 > a,2” and 2 > a_,2", respectively, and the conclu- 
sion (a) follows from Wena 1 and the aati of 2.9 if we show that 

M[G(e)e"] < KAM,[F(e)e"). 


This has been proved by Hardy and Littlewood* when p > 1, —1, p <a <1 p’, 
and these conditions are plainly satisfied here. 
The inequality (b) ean be deduced from (a) by a“‘conjugaecy” argument. Let 


Q(0) = a ed 


7. Riesz [6], Theorems Il and IIl. See also $4.1 of [4] 
S Hardy and Littlewood [4], $6.3. 
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be a polynomial. Then 


[ro Q(8) dé = Lanta, 


II 


| do a,.n*c,n* | 
S,[a,, n*] Splen n “| 


< Sa, n|1KM,-[Q() ) 


| / ‘ F(0)@~* Q(8) & do 


lA 


by Theorem 2(a), since 
a=F+i+h-120, ->rA 20, o>p2q'>1. 
D 


It follows from the converse of Hélder’s inequality that 


M [Foe] < KS,[a,n*]. 
This is what we require. 
3.2. We can deduce from Theorem | the following extensions of Theorems 
9 and 10 of [4]. 
THEOREM 3. Suppose that G(@) is the boundary function of an analytic func- 


tion g(z2) = >. cnz", that 
1 
s,(z) = b c,e"": 
1 
and that p, 4, a satisfy the conditions of Theorem 1. Then 


S|(s.(xz) — s(x))n |] S KM,[(F(x + 6) — s(x))0*™', 


whenever the right side is finite. 
THeoreM 4. Suppose that F(@) is integrable and 


i 
F(@) ~ be a,e""", ay = 0. 


Let 


8,(z) = > ad. 
g(z, 6) = 3[F(z + 0) + F(z — 0) — 28(z)]; 


“z>qazprl, 








orems 


func- 
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Then 


Sl(sn(xz) — s(x))n™] S KM,lo(x, 06°"), 


whenever the right side is finite. 
We obtain Theorem 3 immediately on applying Theorem 1 to the function 
(g(z) — s(0))(1 — 2)". Theorem 4 follows from Theorem 1 by the argument 
given in §5.2 of [4]. 
The Hardy-Littlewood theorems are given, in each case, by a = 1/p’. 
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